FIGURE i 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTC T GAAGACT AACAT T T TGTGAAGT T GTAAAACAGAAAACCTGT TAGAAATGT GG TGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCT CCAGAAAAATGCTTAT TT GGGGCAAT GCTAAATAT TGCGGC AGTT T TAT GCAT T G 
CTACCAT TTATGTTCGT TATAAGCAAGT TCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAJiACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAC5AAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGT T T TGCACAGTGGCAAT T T T GGGACTGAT T TAGAACAGAAAC TCCAT T GGAACCCCGAGG 
ACAAAGGTTAT GTGCTT CACAT GAT CACTAC TGCAGCAGAATGGTCTAT G TCAT TT T CCTTCT T T 
GG TTTTTTCC TGAC T T ACAT T C GT GAT T T T CAGAAAAT T T CT T T ACGGGT G GAAGC CAAT T T AC A 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGATGAAAGGATAAAATAT T TCTGTAAT GAT T ATGAT TCTCAGGGAT TGGGGAAAGG 
TTCACAGAAGT TGC T TATTCT TCTC TGAAAT T T TCAAC CAC T TAATCAAGGCT GACAGTAACAC T 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATT TGATAGAT TAT T CTAAAGGATATCAT 
CAA(3AAGACTATTAAAAACACCTAT GCCTATACTT TT T TATCTCAGAA?VAT AAAGTCAAAAGAC T 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFI FSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIAT I YVRYKQVHALS PEENVI I KLNKAGLVLG I LSCLGLS I VANFQKT T L FAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 




LLSRDI 



Important features: 

Type II transmembrane domain: 



amino acids 13-33 



Other Transmembrane domains: 



amino acids 54-73, 94-113, 160-180, 122-141 



N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 



ry 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTT CGCCATGAGTT TCCT CATC GACTCCAG CAT CATGATTACC TCCCAGATACT ATTT T TTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTT TCAT GGTGCCTT T TTACATT GGCTAT TTTATTGTGAGCAATATCCGAC TACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTAT CCATAGAACAGCTCATCAGCCGGGTTGGTGTGATT GGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATi^GCAAAAAGAAAAGGATGGCAATGGCACGGAGAAC^TGTTCCAGAAGGGGGAAGTGCATAACA 
AACCAT CAGGT T TCTGGGGAATGATAAAAAGTGTTAC CACTTCAGCAT CAGGAAGTGAAAATCT TACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATATGCTACCAAGGAGAGAATAGAATACT CCAAAACC T TCAAGGGGAAATATTTTAATT TTCTT GGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGi^yy^ACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAIVTACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAG^GCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATT TTTCACCT T CATAGCAT ACTCCTTCC CCGT CAGGT GAT ACTATGAC CATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGTAGCT GAGCCAAACACGT AGGAT TTCCGT T T TAAGGTTCACAT GGAAAAGGTTATAGCT TTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE d 

MS FLIDS S IMI TSQILFFGFGWLFFMRQLFKDYE I RQYWQVI FSVT FAFSCTMFEL 1 I FE I LGV 
LNSSSRYFHWK^LCVILLILVFWPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNlfLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSWIVLLLAQIMGMY 
FVS S VLL IRMSMPLE YRT 1 1 TEVLGELQ FNFYHRW FDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 31-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-giycosylation sites . 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGa^GGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAAC T C T CCAT CC GGAC TAG T TAT T GAG CAT CTGCCTCT CATATCACC AGT GGCC AT C T GAG G T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGJy^AGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
5 . ATTATCACCACCAAAGAT C CCAT AT TCAACAC TCAAA.CTGCAACACAAACAACAGAAT TTAT TGT 

? _ CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
13 CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 



GAAACTAGCACCAT G T C TACAGAAAC T GAACCATT T G T TGAAAATAAAGCAGCAT TCAAGAATGA 



(ft AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 

CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GATCGAAAC CAAAGTAGTAAAGGAGGAGAAGGCCAAT GATAGCAACCCTAA 



Wl TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
ifl GAT GCC T GGAAGCT GAAGT TTAGAT GAGACAGAAAT GAGGAGACACACCTGAGGCTGG T T TCT T T 

CAT i3 C T C C TT AC CC T GC C C CAGC T G G GGAAAT CAAAAGGGC CAAAGAACCAAAGAAGAAAG T CCA 
CCCT TGGT TC CT AACTGGAATCAGCT CAGGAC T GCCATTGGAC TAT GG AG T GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTA?y\ACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCT CTGAAAGAGAAACACG TATCCCACCTGACATG T CCT T C TGAGCCCGGT AAGAGCAAAAGAAT 
l J* GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
W GCT^?\AAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
Hi ACAGACAGGG TCAAAGTGT T T TCTCT GAACACATT GAGT T GGAATCACTGT TTAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAACTCTTATAAATT TCTATTTTTATCTGAGTTACAGAAATGAT TACTAAGGAAGATT 
ACT CAGTAAT T T GT T T AAAAAGTAATAAAATT CAACAAACAT TT GCTGAATAGCTACTAT AT GTC 
AAGTGCTGTGCAAGGTAT TACACTCTGTAATTGAATATTATTCCTCAAAAAAT TGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TATTAACATAC CT AAGAAGTACATTGT TACC T CTATATACCAAAGCACAT T T TAAAAGT GC C 
ATT.AACAAATGTATCACTAGCCCTCCTT TTTCCAACAAGAAGGGACTGAGAGATGCAGAAATAT T 
TGT GACAAAAAAT T AAAGCAT T T AGAAAAC T T 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKAS FE TCS YGWVGDG FWT SRI SPNPKCGKNGVGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVICAFPFTNKNQQKEMIETKVVKEEKMfDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site, 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGG TGGAAGAGAT GGAGGCAGAAGAAGC TGC TGCTA 
AAGCATCATCAGAAG T GAACC T GGCAAAC T TACCTCCCAGC TAT CACAATGAGACCAACACAGAC 
ACa^AGGTTGGA^ATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
T GGACAAATGGT CT T T TCAGAGACAGT TATCACAT CT GTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
||| ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
Ifl CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

in 



m 



C CCACAGCCACAGCC TGGTGTATG T G T GCAAGCCGACCT T CGTGGGGAGCCGTGACCAAGAT GGG 
GAGATCC TGC T GCCCAGAGAGGTCC CCGATGAGTAT GAAGT TGGCAGCTT CATGGAGGAGGT GCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Q : CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
f|| GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
f| TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
ps TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
iW CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCT AGTGAAGACCAGAGGAGT T T CATCTGG TT GTGACT CTAAGC TCAG TGC TCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTACTTAGGTAAT T GTAGGGCGAGGAT TATAAATGAAA.T T TGC 
AAAATCAC T TAGCAGC AAC TGAAGACAAT TAT CAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAftAGTTGCA 
CATGAT TG TATAAGCATGC T TT CT T T GAGT TT TAAAT TAT G TATAAACATAAGT TGCAT T TAGAA 
ATC.AAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRL GA T L L C L LLAAAVPT AP AP AP TAT S AP VK P G PAL S YP QEE AT LNEM FRE VE E LME D T QHKL 
RSAVEEMEAEEAAAKAS SEVNLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCWGHC 
TKM2VTRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 14 0-144 
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FIGURE Q 



CGGi ^CGCGTGGGCGGACGCGTGGGGGCTGTGAGiW^GTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCAGAA.GCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
U GAGTATAT TGAC CACAAGC TCAGAGGAGTGCAGAAC CCT GTAGCCCGCTGCATCATGTGC T GT T T 

M 

O C AAG TGCTGCCTCTGGTGTC T GGAAAAAT T T AT CAAG T T C C T AAACC G C AAT GC AT AC AT CAT G A 

C- S TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

m 

ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAfVGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 



I 5 T: 



i - s 



fH CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
CI T TGAGAGGCTGAGGCGGGCGGAT CACCTGAGTCAGGAGT T CGAGACCAGCCTGGCCAACAT GGT G 

iW AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTC 

AAGATTTTAT TAAAGATATT TTGTTAACTC 



AppJD=10063553 



Page 161 of 320 



FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWV31ALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIARVIL 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKFIKFLNRNAYIMIAI YGKNFCVSAKNAFMLLMRN 
IVR^AA^LDKVTDLLLFFGKLLVVGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



5S? 



in 



Q 

3 S 5 
W 5 

D 



Important features: 
Transmembrane domains: 

amino acids 57-80 {type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins . 

amino acids 276-288 

Ammonium transporters proteins. 



p»% amino acids 204-231 

ru 

N-myri s toyla ti on si tes . 

amino acids 60-66, 78-84 



Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

GGG(I!AACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACA?VCGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

C CT C TAC CT GT GGAC C C TGG TAGC C CCAC T CC T CC T G CGCAAC CG C GAC T T CAGC TGAGGCAG C CTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GACT TCGT GCCTTACT GAGT CTCT AAGAC TTTT TCTAATAAACAAGCCAGT GCGT GTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPC I LCSCCPASRNS TVSRLI FTFFLFLGVLVS I IMLS PGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFL I LVGLTVGAFY I PDGS FTNI WFYFGWGS FLFIL IQLVLL I DFAHS WNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGLI IFLLCTLFI SLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFIiFCLVLAS LHVMMT LTNW YKPGETRKMI S T WTAVWVKI CAS WAGLLL YLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGT T TGTT TGTCACCT TTGACCT CT TAT TCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCAT T GAGAACACAT T AGAGAAGGAGGTGAT GCAGTAT GACTACTAT T C TT CATAT 
T TT GATATAT XT CT TCTGGCAGTT T T T CGAT T TAAAGT G T TAATACTTGCATATGC I G TGT GCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TO TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
ffl CTT GCC TGGAT T GAGAC GTGGT TCCT GGAT T TCAAAGT GT TACCTCAAGAAGCAGAAGAAGAAAA 

|y CAGACTCC TGATAGT TCAGGATGC T TCAGAGAGGGCAGCACT TATACCT GGTGG TCT T TCTGAT G 

GTCAGT T T TATT CCCCTCCTGAATCCGAAGCAGGAT CT GAAGAAGCT GAAGAAAAACAGGACAGT 
GAGi : ^AAC C AC T T T T AGAAC T ATGAG T AC T AC T T T T GT T AAAT G T GAAAAAC CC T CACAGAAAG T C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCT GTCGACAGTAAAGTT GAAATGGTGACGT C 

a? 

f% CACT GC TGGCT TTAT TGAACAGCTAATAAAGAT TTAT T TAT T G TAATAC CT CACAAACGT TGTAC 



111 CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
M T CAGTGAGAC TGAGCCTGATG TGT TAACAAATAGGT GAAGAAAGTC T TGTGCTG TAT TCCTAAT C 



AAAi=LGACTTAATATATTGAAGTAACACTTTTTTAGTAAGCAAGATACCTTTTTATTTCAATTCAC 
AGAJ\TGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTT GCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAATGTCACCAGACAT T TGTAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGAC T TAAACCAT TCAT AT CATGTT T CCT T TGCGTTCAGCCAAT T TCAAT TAAAATGAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHS 

VTLLWI IELNVNGGI ENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALT T 

AVTSAFLIJ\KVILSKLFSQGAFGYVLP 

AAL I PGGL S DGQFYS PPESEAGSEEAEEKQDSEKPLLEL 



ri 



Important features of the protein: 
Signal peptide: 
amino acids 1-20 



ffl Transmembrane domains: 



amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 

CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 

GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 

CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 

CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 

AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 

TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 

CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 

CCG(^ATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGAOVACGAGCTGCGGGCAGTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCGACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGC CTGTACT G T GAGAGCCAGAT GGGGCAGGGGACACGGC CCAGCCC TACACCAGTCACGCC GAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 

T T T AGG AACAT GT TTTGCTTTTT T AAAATATATATAT T TATAAGAGAT C C T T T C CCAT T TAT TC T 

GGGAAGATGTT TTTCAAACTCAGAGACAAGGACTTT GGTT TTTGTAAGACAAACGATGATAT GAA 

GGCCT T T TGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 
LGKNR I RH IQPGAFDTLDRLLELKLQDNELRALP PLRLPRLLLLDLSHNSLLALEPG I LDTANVE 
ALRiLAGLGLQQLDEGLFSRLRKLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGKQJUiQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
T RCHFP PKNAGRLLLEL DYADFGC PATT T TAT VP T TRPWRE P TAL S S S LAPT WL S P TAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
T VTQLRPNAT YSVCVMPLG PGRVPE GEEACGEAH T PPAVHSNHAP VT QARE GNLPLL I APALAAV 
LIAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites • 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCC T TAGAT TCCAAGAC TAC T T TGACAT CAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGT TGC TGGTCAAATATT TCTTGATTCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT T T CT AGAGTCT CCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGT A 
CGG.AAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTAT GAT GAATGTACATCAGAT GGGAGGGAAGAT GGCAGACTGTGG T GT G 
CTACAACC TAT GAC T ACAAAGCAGAT GAAAAGTGGGGCT T T TGTGAAACTGAAGAAGAGGCT GCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGAT G T TTGAGAAGC TGAC TGAGGAAGGCT CTCCCAAGGGACAGAC T GC TCT TGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AAT AATAT TAACAT CAGAAGAAT TT G T GGT T TATAGCGGCCACAACT T TT T CAGCT T TCATGAT C 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACATGGAAT CTACATG TAAATGAAAGT TGGT GGAGTCCACAATT T TT CT TTAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT TGTCAGAATCATT TT T TACAT TAGAT TATCAT AAT T T TAAAAAT T T T TC T T TAGT T T T TCA 
AAAT T T T GTAAAT GG T GGC TATAGAAAAACAACATGAAATATTATACAATAT T T TGCAACAATGC 
CCT AAGAAT TG T TAAAAT TCATGGAG T TAT TT GT GCAGAATGACT CCAGAGAGC TC TAC TT T CT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAG AT T AAC T CAT T T T T AATAAAAT TAT GT C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAA&AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myri s toylation si tes . 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



AAT TCAGAT T T T AAGCCCAT TCT GCAGTGGAAT T TCAT GAACTAGCAAGAGGACACCATC T T CT T 
GTAT TATACAAGAAAGGAGTG TACC TATCACACACAGGGGGAAAAA.TGCTCT T T TGGGTGC TAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATA^IGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAA?VAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACT TCGTACTGTGCTTCTGGATGT GACCGACCCAGAGAATGTCAAGAGGACT G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCC GGCGTGC T GGCTCCCACTGACT GGCT GACACTAGAGGACTACAGAGAACCTAT T GAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
WJ- TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TAT GCAGTGGAAGGT TT CAAT GACAGCT TAAGACGGGACATGAAAGCT T T T GGT GTGCACGT CT C 
5*1 ATGCAT TGAACCAGGATTGTTCAAAACAAACT TGGCAGATCCAGTAAAGGTAAT TGAAAAAAAAC 

TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGT CTAGACAAACT GAAAGGCAATAAATCC TAT GT GAACATGGACCT CT CT CCGGTGGTAGAGT G 



O 



U CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
:L! T TT TC TGGATACCT CTGTCTCACAT GCCAGCAGCT T TGCAAGACT T TT TAT TGTT GAAAC AGAAA 

y 

fl$ GCA(3AGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
Q AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

355 B 

111 CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTAT T TAGGC T T TGCCTGC TTGGTGTGAT GTAAGGGAAAT TGAAAGACT TGCCCATT CAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATC TC TTAT CTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAkAAAAAAAAA 



App_ID=10063553 



Page 171 of 320 



FIGURE 2Q 



MLFl/^VLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGISrLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKMQVGEKGLWGLINMAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVS SVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKK^^ 

LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 

*™ Signal peptide: 

Si 

amino acids 1-17 

m 

yj Transmembrane domain: 

amino acids 136-152 

W 

W 

N-glycosylation sites. 

n amino acids 161-163, 187-190 and 253-256 

y? 

^ Glycosaminoglycan attachment site. 

lZ amino acids 39-42 

fit 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT T CAGAAATATAT T CCATGCTAT CAGCT TT T TAGCT T T TATAAT T CT TCAGGCGAAG TAAAT G 
AGC AAGCACTGAAGAAAA/FAT TATCAAATGTCAAAAAGAAT GT GGTAGGT T GGTACAAAT TCCGT 
C G T CAT T C AGAT C AGAT CAT GAC G T T T AGAGAGAGGC T GCT T CAC AAAAAC T T GCAGGAGCAT T T 
T TCAAACCAAGACCT TGT T TT T CTGCT AT TAACACCAAGTATAATAACAGAAAGCTGC TCTACTC 
ATCGACTGGAACAT TCC TTATAT AAACCT CAAAAAGGACT T T TTCACAGGGTACCTT TAGT GG T T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TT TTAGCCGAGCAGTACAAACACACAGCTC TAAATTTT TT GAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAAT GAAA.TGTATGCT TCATTACAAGAGGAATTAAAGAGTATATGCAAMAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACAT TTTTCTTTGT CAGGCAT TACGGACCT TT T T TCCAAAT TCTGAAT T T CT TCAT TCATGT GT T 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATC TGACCTTAATGGTAGAACACAC TGACATTCCT GAAGCTAGTCCAGC TAGTACACCAC 
AAAT CATT AAGCAT AAAGCC TTAGACT TAGATGACAGATGGCAAT TCAAGAGAT CTCGGTT GT TA 
GAT ACACAAGACAAACGAT CTAAAGCAAATAC TGGT AGTAGTAAC CAAGATAAAGCAT CCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTT TGGTGAATATTCACGGT CTC 
CTACATT T TGA T CCT T TTAACCT TACAAGGAGAT ? T T T T TAT T T GGCT GAT GGG TAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGI' T TGCAGTAATACACAGATAACTCT TAGT GCAT T TACTT CACAAAG T ACT T T T TCAAACAT CA 
GAT GC T T T TAT T TCCAAAC CTT TT T T T CACCTT TCAC TAAG T TGT TGAGGGGAAGGCT TACACAG 
ACACAT TC T T T AGAAT TGGAAAAGTGAGACCAGGCACAGT GGC T CACAC CT GTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT T GAT TGAAGCTAGGAGT TAGAGAC CAGC CTGGGCAAC GT ATT 
GAGJACCATGTCTATTAAAAAATAAAAT GGAAAAGCAAGAATAGCCTT AT T T TCAAAATAT GGAAA 
GA? AT T TA TAT GAAAAT T TAT C T GAG TCAT TAAAATTC TCCTTAAGT GAT AC T T T T T T AGAAG T A 
CATTATGGCT AGAGT T GCCAGAT AAAA.TGC TGGATATCAT GCAATAAAT T T GCAAAACAT CATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGES T S AVL S GFVLGALAFQHLNT DS DTEGFLLGEVKGEAKNS I TDSQMDDVE WYT I DI QKY I 
PCYQLFS FYNS SGEVNEQALKKI LSNVKKNWGWYKFRRHSDQIMT FRERLLHKNLQEH FSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLVVANLQ1SEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQj^EKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDPCRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCA.CAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCA.GCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GSTGGGAACGCGTCCGCCCCGGCCCGCACCRTGGCACGGTT CGGCTTGCCCGCGCTT CTCTGCACCCT GGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
hh AACAAGAACGAT GCC CCCCTCCACGAGATCAAC GGT GATCATTT GAAGAT CTGTC CC CAGGGTT CTAC CT GCTGCT CT 

C AAGAGAT GGAGGAGAAGT ACAGCC T GCAAAGT AAAGAT GATT T CAAAAGT GT GGT C AG C GAAC AGT GC AAT CAT T T G 
Q CAAGCTGT CTTT GCTT CACGTTACAAGAAGTTT GAT GAATT CT TCAAAGAACTACTT GAAAAT GCAGAGAAAT CCCTG 

01 AATGATATGTTT GTGAAGACAT ATGGCCAT TTAT ACATGCAAAATT CT GAGCTAT TT AAAGAT CT CTT CGTAGAGTTG 

Uj AAACGTTACTACGTGGTGGGAAATGT GAACCTGGAAGAAATGCTAAAT GACTTCTGGGCTCGCCT CCTGGAGCGGATG 

Ifl TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
I?! CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
Ul TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 



AT GAT CTACTGCT CCCACT GCCGGGGTCTCGTGACT GTGAAGCCATGT TACAACT ACT GCT CAAACAT CAT GAGAGGC 
T GTTT GGC CAAC CAAGGGGATCT CGATTTT GAATGGAACAATTTCATAGAT GCTATGCTGAT GGT GGCAGAGAGGCTA 
U I GAGGGTCCTTTCAACATTGAATCGGTC^TGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACAT 

AATAGT GTT CAAGTGT CT CAGAAGGT TTT C CAGGGAT GTGGAC C CCC CAAGCCC CTCCCAGCTGGACGAATTTCT CGT 
T C CAT CTCT GAAAGT GCCTTCAGTGCTCGCTTCAGACCACATCACC CC GAGGAACGCCCAACCACAGCAGCT G GCACT 
AGTTT GGACCGACTGGTTACTGATGT CAAGGAGAAACT GAAACAGG CC AAGAAAT TCT GGT CCT C CCTT CC GAGCAAC 
GTT TGCAAC GAT GAGAGGATGGCTGC AGGAAACGGCAAT GAGGATGACTGT TGGAATGGGAAAGGCAAAAGCAGGT AC 
CT GTTTGCAGTGACAGGAAAT GGATTAGCCAACCAGGGCAACAACC CAGAGGT CCAGGTT GACACCAGCAAACCAGAC 
ATACT GAT CCTTCGT CAAATCAT GGCTCT T CGAGT GATGACCAGCAAGAT GAAGAAT GCATACAATGGGAACGACGTG 
GACTT CT TT GAT AT CAGT GAT GAAAGTAGT G GAGAAGGAAGTGGAAGT G GCTGT GAGTATCAGCAGTGCCCTT CAGAG 
TTT (5ACTACAAT GC CACT GACCAT GCTGGGAAGAGT GCCAATGAGAAAGC C GACAGT GCT GGT GT C CGT CCTGGGGCA 
CAGGCCTAC CT CCTCACTGT CTT CTGCAT CT TGT T CCTGGT TAT GCAGAGAGAGT GGAGATAAT T CTCAAACTCT GAG 
AAAAAGTGTT CAT CAAAAAGTTAAAAGGCACCAGTTATCACTTTT CT AC CATCCTAGT GACTTT GCTTTTTAAATGAA 
T GGACAACAAT GTACAGTTT TTACTATGT GGCC ACT GGTTT AAGAAGT GCT GACT T T GT TTTCT CATT CAGTT TT GGG 
AGGiW^GGGACTGTGCATTGAGTTGGTTCCTGCTCCCCC?WVCCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT (3AAATAT TAAATAGCT GTACAGAAGCAGGT T TT ATTT ATCATGTT AT CTTAT TAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 2d 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAG 

VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMmVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) ♦ 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCflAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATC CCCAGGGATCAGGAGCCTCCAGCAGGGAACC T T CCAT TATATTC TTCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGC CCC GAGAAGAAAAT T CAT GACAG T G T C T GGGC T GC CAAAGAAGC AG T G CCC C T GT GAT CAT T 
TCAi^GGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGC GCCCACTC TTCCAAT TAAACATTCTCAGC CAAGAAGACAGT GAGCACACCTACCAGACACT C 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCAT TTAAC CTTAAATGCAATCAGGAAAGT AGCAAACAGAAGT CAATAAATAT TT T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVL I S S L LL LL PLMLM SMVS S S LNPGVARG HRDRG QAS RRWL QEGG QE CE CK DW FL RAPRRK FM 
TVS GLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRS FALPL 



in 

in 

III 
2 

M 



is? 



Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
p. GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
Q CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
IP TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
Jj* GAGCCCACCTGGAAACACAT TGGGGAT GGCTGCT GC C T CACCAGAGAGACCTGGAAGGAT C T TGA 

GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
hj GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
c TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
O GCAGAAT GAGAGAAGACAT TCAT GTACCACCTACTAG TCCC TC TCTCCCCAACC T C TGCCAGGGC 

AAT CTCTAAC T T CAAT CCCGCC T TCGACAGTGAAAAAGC T C TAC TTC TACGC TGACCCAGGGAGG 
AAACACTAGGACCCT GTT G TATCCT CAAC TGCAAGT T T C T GGAC TAGTCT CCCAACGT T T GC C T C 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACT CAAAGAGGCAGAGGT TT T GT TCTCAAATAT T T T T TAATAAATAGAC GAA 
ACCACG 



n 



App_ID=10063553 



Page 179 of 320 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQ!U\DGSMDVWCTLVLCSVQSPRKVLQEWR 

KHIGDGCCLTRETWKDLEJmQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites . 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 

GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGA^AGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCT TAT TT AAA 
GTGGTTGTT TAT GAT TC T TAT AC TAATT TATACAAAGATAT TAAGGCCCTGT T CATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAA.TAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 



Growth factor and cytokines receptors family: 

amino acids 3-18 



III 



yl 
W 
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FIGURE ftl 



m 
i.y 

w 

r i 

r« 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCC T TGGCC T GTGTATCC T CGCAT TAGC C T T GTCT T TGGCCATGATG TT TACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGT CTGCGGTGT TT TATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 



jm AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 



TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT T CCGAGAAT CAT T GTCATGT ACAT GCAAAACGCACTGAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC CAGAAT GCATATACTACAAC TGC T AT TAAT GGGACAGATTT C TGTACATCAGCAAAAGATGC 
ATT CAAAATCT T GT CCAAGAACT CAAGTCACT T TACATCTAT TAACT GCT T TGGAGAC T TCATAA 



f|| TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 

AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAAT GAGGA 
GGGAACAGAACT CCAGGCCAT TGTGAGATAGATACC CAT T TAGGTAT CTGT ACC TGGAAAACAT T 
T CCT TC TAAGAGCCAT T TACAGAATAGAAGATGAGACCACTAGAGAAAAGT TAGTGAAT T T T T T T 
T TAAAAGACC TAAT AAACCCTATTC T T CCTCAAAA 
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FIGURE 22 



,p..vi 



Saw-' 



MSGRDTILGLCIIiAIALSLAMMFTF^FITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGI TAVLLVLI FVLRKRIKLTVEL FQI TNKAI SSAPFLLFQPLWTFA. 
I L I FFWVLWVAVLL S LG TAGAAQVME GGQVE YKPLS G I RYMWS YHL I GL I WTS E F I LACQQMT I A 
GAWTCYFNRSKNDPPDHP ILS SLS I LFFYHQGT WKGS FLI S WRI PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWC FTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFE TVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 



if! Putative transmembrane domains: 

W amino acids 35-54, 75-97, 126-146, 1S5-204, 333-350, 352-371 

a? 

sis N-glycosylation sites. 



amino acids 204-208, 295-299, 313-317 
N-myristoYlation sites, 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 3 



u 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT GT CAAAGCCGGAAAGATCATCGAT CCTGAGTTCATT GTGAAATGTCCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
T AC ACAGTGGT GTGCT TGATAAT TCAGGAGGGAAAATACTTGT T C GGAAGGT TGCTGGACAGTCT 
GGT TACAAAGGGAGTTAT TCCAACGGT GTCCAATCGT TATCCC TACCACGATGGAGAGAATCCTT 
TAT CGTCT TAGAAAGT AAACCCAAAAAGGG TGTAAC CTACCCATC AGCT CT TACATAC TCAT CAT 
CGAAAAGTCCAGCT GCCCAAGCAGGTGAGACCACAAAAGCC TATCAGAGGCCACC TAT T CCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACC TTGC CAAGGC CATCCCCT T CT GC TGC T TCTAC CACCAGCATCC CCAGACCACAATCAGTGG 
GCCACAGGAGCCAGGAGATGGAT CTCT GGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
' ! AGAGCTGATCCAGGTATCCAAAGGCAAGAT CC T TCAGGAGC TGCC TT CCAGAAACC TG T TGGAGC 

V GGA.TGTCAGCCT GGGACT TGTTCCAAAAGAAGAATT GAGCACACAGT CTT TGGAGCCAG TATCCC 

f! T GGGAGAT CCAAACTGCAAAATT GAC T TGTCGT T TT TAAT T GATGGGAGCACCAGCAT T GGCAAA 

I! CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACT TTCTAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GAC TTGCGAGAGAGTCAGGAATCAACAT TT TCT TCATCACCAT TGAAGGTGC TGC TGAAftAT GAG 
f«j AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
\:l ACA.CTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGT T TGAGAT T TCCGACACGGACAC GCGCATC GGGGCCG TGCAGTACACCTACGAACAGC GGC 
T GG AGT T T GGG T TC GACAAGTACAGCAGCAAGCCTGACAT CCTCAACGCCAT CAAGAGGGT GGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGC CCAACAAGAGGAAGTTAAT GAT CCT CAT CACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT T TGACAACCT CCAT CAGTATGTCCC CAGGAT CAT CCAGAACAT T TGTACAGAGT TCAACT CAC 
AGCCTCGGAACTGAAT TCAGAGCAGGCAGAGCACCAGCAAG TGCT GC T TTAC TAAC TGACG T G T T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AA^.TGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGAT T TTACAT 
TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGTCAT GACAATGTAGGAATTGCT GAAT TAAAT G T T TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?^^ 
AAA&AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 

MRT WLTMKASV IEMFLVLLVTGVHSNKETAKKI KRPKFTVPQ INCDVKAGKI I DPE FI VKC PAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FI VLESKPKKGVTYPS ALT YSS SKSPAAQAGETTKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VG^DVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNWVVMVDGWPTDKVEEASRLARESGIN^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LI T DGRS YDDVRI PAMAAHLKGVI T YAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Sigrnal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami. da t ion site. 

amino acids 304-308 
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FIGURE 35 



in 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAA^CCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAAC AAAAAACTT AAGCT TT AATTTC AT CT GGAATTCC AC AGTT TTCTT AGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT C ACTGAGATCCCT C AAAT GGAGCC TC CTGCTGCT GTCACTCCT GAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
L ml CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 

TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
H TGGT GGGGAT ATGAGGTT C TTACAT TTT TCTT ATT AGGCC AAGAGGCT GAAAAGGAAGACAAAATGTTGGC 

|f! ATTGTCCTTAGAGGAT GAAC ACCT T CTT T ATGGTGACATAAT CCGACAAGAT T T TTTAGACAC AT ATAAT A 

III ACCT GACCT T GAAAACC AT T ATGGCATT CAGGTGGGTAAC T G AGTTTT GCCC C AATGCCAAGT ACGTAATG 

AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAGTT TTTC AC AGGT T AT CCTCT AATT GATAATTATT CCT AT AGAGGAT TTT ACC AAAAAACCC ATATT T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
T TGGTGCC AAGG AT CT AT GAAATGATGGGTCACGTAAAAC CC ATC AAGTT TGAAGATGT T TATGTCGGGAT 
CTGT TT GAATTT AT TAAAAGTG AACATT CAT ATTCCAGAAGAC AC AAATCTT TT CTTTCT AT AT AGAAT CC 
f*k ATT T GGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTT CTTCCAAGGAGATCATCACTTTT 

f|| TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
ACAC TGAACTGAAACT CATG AAAAACCCAGAC TGGAGACT GG AGGGTT ACACTT GTGAT T TATT AGTCAGG 
CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
AC C AAAC AATTT GGAC AT GTC ATT CTGT AGAC TAGAAT TT CTT AAAAGGGTGTTACTGAGT TAT AAGCT CA 
CT AGGCTGT AAAAACAAAAC AATGT AGAGTTT TAT T TAT T GAAC AAT GTAGT C ACT TGAAGGTTTTGT G T A 
TAT CTT ATGTGGATTACCAATTTAAAAATATATGTAGT TCTGTGTCAAAAAACTTCTTCACT GAAGTT ATA 
CTG. AACAAAATT TT AC CT GTTT TT GGT CATTT AT AAAGTACTTC AAGATGTT GC AGTATT TC AC AGTT AT T 
ATT ATT T AAAAT T ACT TC AACTTT GT GT TTTT AAAT GTTT TGACGATT TCAAT AC AAG AT AAAAAGGAT AG 
TGA&TCATTCTT TACATGC AAACAT TTT CC AGT T AC TT AACTGATC AGTT TATT AT TGAT AC ATC ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAG AAT TAAAGC AAGAAAATCTG AAAA 



III 
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FIGURE 36 



MAS ALWTVLP SRMS LRSLKWS LL LLS LLS FFVMWYLS LPHYNVT ERVNWMYFYEYEP I YRQDFHF 
T LREHSNC S HQNP FLVI LVT S HP S DVKARQAI RVT WGEKKS WWGYE VLT F FLLGQEAEKE DKML A 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMM'RWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKWIHIPEDTNLFFLYRIHLDVGQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGT C CTTGCCAAGAGAGT ACAC AGTCAT TAATGAAGCCTGCC CT GGAGCA&AGTGGAAT ATCAT GTGTCGGGAGTGCTGTG 
AATA.T GATCAGATTGAGTGCGTC T GCCCC GGAAAGAGGGAAGTCGTGGGT T ATACCATCC C TTGC TGCAGGAATGAGGAGAA 
TGAGTGTGACT CCTGCC TGATCC ACCCAGGT TGTACCATCTTTGAAAAC TGC AAGAGCTGC CGAAATGGCT CATGGGGGGGT 
AC C T T GGATGAC TTC TATGTGAAGGGGTT C TAC TGTGCAGAGTGC CGAGC AGGC TGGTACGGAGGAGACTGCATGCGAT GT G 
GCCAGGTTCTGCGAGCCCC AAAGGGTCAGAT TT TGT TGGAAAGCTAT CCCC TAAAT GCTCACTGTGAATGGAC CATT CATGC 
TAAAJCCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
GTT C GTGATGGAGACAACC GC GAT GGCCAGATCATCAAGCGTGTCTGTGGC AACGAGCGGC CAGC T CCTATCCAGAGCAT AG 
GAT C C TCACTC C ACGTCCT C T TC CAC TCC GATGGCTCCAAGAATTTT GACGG T TTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCAT CCCC T TGT TTC CAT GACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAG<:GCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCAGACCCTGGGG<5CCCAGTCAATGGGTACCAGAA 
TAACAGGGGGCCCTGGGCT TATCAACGGACGCCATGCTAAAAT TGGCACCGTGGTGTCTT TCTTTTGTAACAACTCCTATGT 
"ft TCT T AGTGGCAATGAGAAAAGAAC TTGCC AGCAGAATGGAGAGTGGT CAGGGAAAC AGCCCATCTGCATAAAAGCCTGC CGA 

W GAAC CAAAGAT T TCAGACC TGGT GAGAAGGAGAGTTC TTCCGATGCAGGT TC AGTCAAGGGAGACACCATTACACCAGC TAT 

i w 

JH ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 

U t ATACCAACATCTGCATACCCAGC TCCAGTATGAGTGCATCTCACCCT TC TACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 

W CTGAGGACTGGGAAGTGGAGTGGGCGGGGACCATCCTGCATCCCTATCTGCGGGAAAATTGAG^ 

* CCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC^TCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 

|*# GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

111 GTCA.CCAT GATC AAGAC AGCAGAC C TGAAAGTTGTT T TGGGGAAATT C T AC CGGGATGAT GAC CGGGATGAGAAGACCAT C C 

|J agagcctacagatttctg<:tatcattctgcatcccaactatgaccccatcctgcttgatgctgacatcgccatcctgaagct 
HI ccta.gacaaggcccgtatcagcacccgagtccagcccatctgcctcgctgccagtcgggatctcagcacttccttccaggag 
Q tcccactvtcactgtggctggctggaatgtcctggcagacgtgaggagccctggcttcaagaacgacac^ctgcgctctgggg 
f y tggtcagtgtggtggac tcgc tgctgtgtgaggagcagcatgaggaccatggcatcccagtgagtgtcactgataacatgtt 

ctgtgccagctgggaacccactgccccttctgatatctgcactgcagagacaggaggcatcgcggctgtgtccttcccggga 
cgagcatctcctgagccacgctggcatctgatgggactggtcagctggag<:tatgataaaacatgcagccacaggctctcca 
ctgccttcaccaag^tgctgccttttaaagactggattgaaag 

tgtttctgtatatccgtctgtacgtgtgtcattgcgtgaagcagtgtgggcctgaagtgtgatttggcctgtgaacttggct 
gtgccagggcttctgacttcagggacaaaactc^tgaagggtgagtagacctccattgctggtaggctgatgccgcgtcca 
ctactaggacagccaattggaagatgccagggcttgcaagaagtaagtttcttc^^ 

ctccactgacctggtggtcttccccaactttcagttatacgaatgccatcagcttgaccagggaagatctgggcttcatgag 
gccccttttgaggctctc^vgttctagagagctgcctgtgggacagcccagggcagcagagctgggatgtggtgcatgcctt 
tgtgtacatggccacagtacagtctggtccttttccttccccatctcttgtacacattttaataaaataagggttggcttct 
gaac tacaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaw 
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FIGURE 38 
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MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ ILLE S YPLNAHCEWT IHAKPGFVI QLRFVMLSLE FDYMCQYDYVE VRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACL.^GYTGQRCENLLEERNCSDPGGPWGYQKITGGPGLINGRHAKIGTVVSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENIT7VP 
KTQGLRWPWQAAI YRRT SGVHDGS LHKGAWFLVCS GMjVNERTVVVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 2 60-272 
N-glycosylation sites, 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Ami da t ion site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGA?lGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 

tttgggcccttgtagctgacagaaggtggccagggagaatgcagcacactgctcggagaatgaaggcgcttctgttgc 
tggtcttgccttggctcagtcctgctaactacattgacaatgtgggcaacctgcacttcctgtattcagaactctgta 
aaggt gcct ccc actacggcct gaccaaagataggaagaggc gctcacaagatggctgtccagac ggc tgt gcgagcc 
tcac^gccacggctccctccccagaggtttctgcagctgccaccatctccttaatgacagacgagcctggcctagaca 

ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACrCTGGCCGGAGCAACCGAACTA 

gggc^cggccctttgagagatccactattagaagcagatcatttaaaaaaataaatcgagctttgagtgttcttcgaa 
ggacaaagagcgggagtgcagtt gccaaccatgccgaccagggcagggaaaattctgaaaacaccact gcccctgaag 
T CTTT ccaaggttgtaccacctgattccagatggtgaaattaccagcatcaagatcaatcgagtagatcccagtgaaa 
U gcctctctattaggctggtgggaggtagcgaaaccccactggtccatatcattatccaacacatttatcgtgatgggg 
U * tgatcgccagagacggccggctactgccaggagacatcattctaaaggtcaacgggat ggacatcagcaatgtccctc 

W ACAACTACGCTGTGCGTCT CCT GCGGCAGCCCT GCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

yj gcaggaacaatggacaggccccggat gcctacagaccccgagatgacagctttcatgt gattctcaacaaaagtagcc 

J 5 ccgj^ggagcagcttggaataaaactggtgcgcaaggtggatgagcctggggttttcatctt caat gtgctggatggcg 

2 ^ S 

*M gtgtggcatatcgacatggtcagctt gaggagaatgaccgtgtgttagccatcaatggacatgat cttcgatatggca 

S GCCCAGAAAGTGCGGCTCAT CT GATT CAGGCCAGT GAAAGACGT GTTCAC CTCGT CGT GTC CCGCCAGGTT CGGCAGC 

W GGAGCCCTGACATCTTTCAGGAAGCCGGCT GGAACAGCAAT GGCAGCT GGTCCCCAGGGCCAGGGGAGAGGAGCAACA 

IJ 1 CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

Jj* GCATGACCGT CGCAGGGGGAGCATCACAT AGAGAAT GGGATTT GCCT ATCT AT GT CAT CAGT GT T GAGCCCGGAGGAG 

|M T CATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTT GTT GAAT GT GGAT GGGGT CGAACT GACAGAGGTCAGCC 

W GGAGT GAGGCAGT GGCATT ATTGAAAAGAACAT CAT CCTC GATAGTACT CAAAGCTTT GGAAGT CAAAGAGTATGAGC 

mi 

«W C CCAGGAAGACT G CAGCAGCC CAGCAGC C CT GGACT CCAACCACAACAT GGCCCCACC CAGTGACTGGT CCCCAT C CT 

GGGTCATGT GGCT GGAATT AC CACGGTGCTT GT ATAACTGT AAAGATATT GTATT ACGAAGAAACACAGCT GGAAGTC 
T GGGCT TCT GCAT TGTAGGAGGTTAT GAAGAAT ACAATGGAAACAAACCTTTTTTCAT CAAATC CATT GTTGAAGGAA 
CACCAGCAT ACAATGATGGAAGAATTAGAT GTGGT GATAT T CT T CTT GCT GTCAAT GGTAGAAGT ACAT CAGGAAT GA 
TACAT GCTT GCTTGGCAAGACT GCT GAAAGAACTTAAAGGAAGAATTACTCTAACTAT TGT TTCTTGGCCT GGCACTT 
TTTTATAGAATCAATGATGGGTGAGAGGAAAACAGAA^ 

TTGT CAGTTTTTATAT TTAAAGAAAGAAT ACATT GT AAAAATGT CAG GAAAAGTATGATCAT CTAAT GAAAGCCAGTT 
ACAC CTCAGAAAATAT GATT CC AAAAAAAT TAAAACTACT AGT TTTTT TT CAGT GTGGAGGAT T TCT CATTACTCT AC 
AACi\T TGT TTAT ATTT TTT CTATT CAATAAAAAGC CCTAAAACAACTAAAAT GAT TGAT TT GTATAC C CCACTGAATT 
CAAGCT GATTTAAATTTAAAATTT GGTATAT GCT GAAGTCT GCCAAGGGTACATTAT GGCCAT TTT TAATT TACAGCT 
AAAAT ATT T TTT AAAATGCATT GCTGAGAAACGT T GCTTT CAT CAAACAAGAATAAATATT TT T CAGAAGT T AAA 
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FIGURE AO 

MKALLLLVLPWLS PANY I DNVGNLH FL YSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENT TAPEVFPRLYHL I PDGE I TS I KINRVDP S E SLS I RLV 
GGSETPLVHII IQHI YRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKS SPEEQLGIKLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAI NGHDLRYGS PE SAAHL I QAS ERRVHLWSRQVRQRS PD I FQEAGWNSNGS WS PG 
PGERSNTPKPLHPT I TCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLWDGVELTEVSRSEAVALLKRTSSSIVLKALEWEYEPQEDCSSPAALDSNHNMAPP 
S DWS PSWVMWLE LPRCLYNCKD I VLRRNTAGSLG FC I VGGYEEYNGNKP FFI KS I VEGTPAYNDG 
RIRCGDILLAVNGRS TS GMI HACIARLLKELKGRI TLT I VSWPGT FL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myri s toyla tion si tes . 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE di 



ACCAGGCATTGTATCT TCAGTTGTCATCAAGTT CGCAATCAGATTGGAAAAGCTCAACTT GAAGCTTT 
CT TGCCT GCAGTGAAGCAGAGAGATAGATATTATT CACGTAATAAAAAAC ATGG GCTTCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CAC CAGT AACT ACTT CGTGGGTGC CAT T CAAGAGAT T CCT AAAGC AAAG GAGTTC AT GG C TAAT T TC C 
AT^GACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GAC/bACT GT CC T T CT G T GT C T CCT TACC T CAGAGG CCAGAG CAAG CT CATT T TCAAACCAGAT C T CAC 
TTT GGAAGAGGTACAGGCAGAAAATCCCAAAGT GT CCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTT TACAGAGGGTCGCCATCCTCGTTCC CCAC C GGAACAGAGAGAAACACCTGATGTAC CTGCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGT T TAAT CGAGCC AAAC T C TTGAAT GT GGGC TAT C TAGAAGCC CT CAAG GAAGAAAAT T GGG 
ACT GCT TTATATTCCACGAT GTGGACCTGGTACCCGAGAATGACTT TAACC TTTACAAGT GTGAGGAG 
CAT CCCAAGCATCTGGTGGT TGGCAGGAACAGCACTGGGTACAGGT TAC GTTACAGTGGATATT TTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGT TT TT CAAGGT GAATGGAT TC TC TAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GAT GAAGC T CT TACAC CAAGTGT CACGAG TC T GGAGAACAGATG G GT T GAGTAGTT GTT C T TAT AAAT 
TAGTATC TGTGGAACACAATCCT TTATATATCAACAT CACAGTGGAT TT CT GGTTTGGTGCATGACCC 
f|| TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
Q ATAGTAGCACACATTAAGAACCT GTTACAGCTCATTGT T GAGCT GAATT TT TCCTTT TTGTATT TTCT 

ill TAGCAGAGCTCCTGGT GATGTAGAGTATAAAACAGT T GTAACAAGACAGCT T TCTTAGTCATTT T GAT 

CAT GAGGGTTAAATAT TGTAATATGGATACT TGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GCTATTTGAGGAC TCT GGTTGAAGGAGATT TATTTAAAT TTGAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGT TGTT CTCGT CCAAGGT AGAA 
AGGT AC GAAGATACAAT AC T GTT AT TCAT TTAT CC T GTACAATCAT CT G T GAAG T GG TG G T GT C AGG T 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTT GGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTT CAGGGGAGGACC T GCCCAGGTATGCCTTCCAGTGAT GCCCACCAGAGAATACATTCTCTAT TAGT 
TTT TAAAGAGT T T TT GT AAAAT GAT TTT G TAC AAGTAGGATAT GAAT TAGCAGTT TACAAGTT T ACAT 
ATT^CTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 



MGFIJLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVS PYLRGQSKL I FKPDLTLEEVQAENPKVS RGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKF^ 

DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 



B 

Signal peptide: 

ffi amino acids 1-27 

w 

^; N-glycosylation sites. 

U! 

I s § amino acids 4-8, 220-224, 335-339 



O Xylose isomerase proteins. 

IP 

ammo acids 191-202 

.SStS. 



W 5 
lit 
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FIGURE 43 
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GCK^AGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAZULAAAAA&AAAAAAAAAAAAAAAAAAA^ 

AAAi^AAAAAAAAAAAAAAAAAAAAAAAAAA 



AppJD=10063553 



Page 195 of 320 



FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVETQQ 
CI FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myr i s toy 1 a t i on s i te . 

amino acids 44-50 

Profcaryotic membrane lipoprotein lipid attachment site, 

amino acids 1-12 
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FIGURE 45 

GTGGCT TCAT T T CAG T GGC TGACT T CCAGAGAGCAATATGGCTGGT TCCCCAACATGCCT GACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTT CAACACAACCCCTCT TGT CACCATACAGCCAGAAGGGGGCACT AT CATAG TGACCCAAA 
ATCG TAATAGGGAGAGAGTAGACT T CCCAGATGGAGGC TACTCCC TGAAGCTCAGCAAACTGAAG 
AAGAATGACT CAGGGATCTAC TATGTGGGGATATACAGCT CAT CACT CCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCT GACAT GCT GCAT GGAACAT GGGGAAGAGGATGTGAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCT TCATC TGCGT T GCCAGGAAC CC TGTCAGCAGAAACT 
TCT(IAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGT GGACAT T TGTCGGG 
AAACTC CTAACATATGCCCCCAT T CTGGAGAGAACACAGAGTACGACACAATCC CTCACACT AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT T TACT CCACTGT GGAAATACCGAAAAAGAT 
GGAiyVATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTAT TT GCCTATGAGAATGT TA 
TC TAGA CAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 

MAGSPTCLTL I YI LWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDS IVWTFNTTPLVT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLT^ 

VARl^PVSRNFSSPIlJUlKLCEGiyy^DPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDI CRE TPNI CPHSGENTE YDT I PHTNRT I LKEDPANT VYS TVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 



Important features: 
W Signal peptide: 



m 
iy 

in 

ill 



amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 



pr a Leucine zipper pattern. 

im amino acids 229-251 



N-glycosylation sites. 



Jj; amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

291--295 
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FIGURE 47 



5 : 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
AT T CAGCC TGCTGGTTC TACTGCTGTTAGGAGTAGT TCTCAATGCGATACCTCTAAT TGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T G TGAAT TT T CATTGAAAAACAT CAG T GACAT TCAT CCAGAATCCT TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGC GAG T GGC TGGAGAGCATCTAGT T TCCACT T CGATTCT GAAGAAAACAAAC ATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 



I!! T GTAG T T TAATGGGAATAAAATGTAAGTATCAGTAGTT TGAAAAAAAAAAA 

If! 



5 S 
ill 



SI » 
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FIGURE AS 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMSLTARKRACCNNRTGMFL SS FFS VI T VIGALYCML I S I QALLKGPLMCNS P SNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE J.Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAAC C TGCAGAGTCACACAGT TGCTGACCAAT TGAGCT GTGAGCCT GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATiSGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCC TGTACCT GAGAAGGCCATC CCACTCATCACTCCAGGC TCT GCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 



f«t TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 

gl TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 

III 
|R 

!H 

M s 

hi 

* 

II! 
O 

ru 
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FIGURE 50 



S3 

f s l amino acids 36-59 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I T PGSAT T C 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 



III N-myristoylation sites, 

amino acids 41-41, 45-51, 84-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 

? !1 amino acids 54-67 
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FIGURE si 



GTGGACTC TGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCT GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGTGGGAGGAAGACAC T CT GGAGAGAGAGGGGGC TGGGCAGAGATGAAGT TCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
C GGAGAGGAAAGCAC T GGGACAAAT AT TGGGGAGGC CC TT GGACATGGCC T GGGAGACGCCC TGA 
GCGAAGGGGTGGGAAAGGCCATT GGCAAAGAGGCCGGAGGGGCAGCT GGCT CTAAAGTCAGTGAG 
GCC CT TGGCCAAGGGACCAGAGAAGCAGT TGGCACT GGAGTCAGGCAGGT T CCAGGC T T T GGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
fiR. GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
W CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
O ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
Vf AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

!!! TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

pi 

a. w 



uj 



CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAAGATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGAGGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T TTAAATC CAAGC TGGGT T TCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGC T C T C 
GCATCCCG TGACCT CC AGACAAGGAGCCACCAGAT T GGAT GGGAGCC CCCACAC TCCC TC C T TAA 
AACACCAC CC TCTCATCAC TAAT CT CAGCCCT TGCC CT TGAAATAAACCT TAGC TGCCCCACAAA 
AAA^^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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3 8 
HI 

!-3 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGT REAVGT GVRQVP GFGAADALGNRVGE AAHALGNT GHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGMGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWl^SETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 
amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 



jW Glycosaminoglycan attachment site. 

fj; amino acids 235-239, 237-241 f 244-248 r 255-259 f 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 
N-myris toyl a tion si te . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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GGAGJAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 

CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 

ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 

CACMCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 

TTGi^AGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 

CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 

AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 

GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

CCTCAGAGGGCAGCAGTCGTC TGGACAT GT TTGAGCACAT CAGCCTCAT GACCT T GGACAGT CT A 

CAGAAATGCATCTTCAGCTT TGACAGCCATTGTCTiGGAGAGGCCCAGTGAATATAT TGCCACCAT 

CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 

ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 

CAT T G T C AGAT GAG G AT AT AAG AG C AG AG GC T G ACAC C T T CAT G T T T G GAG GCC AT GAC ACC AC G 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCAT CGATATTATAGGGGTCCAT CACAACCCAAC T GTGT GGCCGGAT CCT GAGGTC TACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



FIGURE *4 



MSLLSLPWLGLRPVJVMSPWLLLLLWGSWL 

LITPTEEGLKDSTQMSATYSQGFTWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEG I LL SGGDKWSRHRRMLTPAFHFNILKS Y I T I FNKSAN IMLDKWQHLAS EGS S RLDMFEHI 
SLMTLD.SLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVtlDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDT TASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKE IEWDDLAQLPFLTMCVKES LR 
LHPPAP F I SRCCTQDI VLPDGRVI PKGI TCLIDII GVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFI P FSAGPRNCI GQAFAMAEMKWLALMLLHFRFLPDHTE PRRKLEL IMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 4 61-471 

N-glycosylation sites . 

amino acids 112-116, 168-172 
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FIGURE 55 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACA.GCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGT T T TATGAAGCT T TGGAAGGCACTATGGACAGAAGC TGGTGGACAGT T T T 
GTMCTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCT GGATGT TGT CCCAC TGAATT CCCATGAATACAAACCTAT TCAGCAACAGCAAAAAAAAAAAA 
AAA2iAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 56 

MG PVKQLKRMFE PTRL I AT IMVLLCFALTLC S AFWWHNKGLAL I FC I LQS LALTW Y S LS F I P FAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 



U Type II fibronectin collagen-binding domain protein, 

amino acids 30-72 
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CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC^TGCTCTGTCTGTGCCTG 
T ACGT GC CGGTCATCGGGGAAGC CCAGAC C GAGT T CCAGTACTT T GAGTCGAAGGGGCTC CCTGCCGAGCTGAAGT CC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
G GAGATAAGGACCTT GATGGGCAGCT AGACTTTGAAGAATT TGT CCAT TATCTCCAAGAT CATGAGAAGAAGCTGAGG 
CT GGT GTTT AAGATT TTGGACAAAAAGAAT GATGGACGCATTGACGC GCAGGAGATCAT GCAGT CCCT GCGGGACTTG 
GGAGT CAAGATATCT GAAGAGCAGGCAGAAAAAATT CTCAAGAGCAT GGAT AAAAAC GGCACGAT GACC AT C GACT GG 
AACGi^GTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 

AT CTT TGAT GTG G GT GAGAATCT AACGGT CCCGGAT GAGT TCACAGT GGAGGAGAGGCAGACGGGGAT GTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCTCC CGCAG CAACAACAT GGGCAT CGT TGGT GG CTT CACTCAGATGATT CGAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGG(^AATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATT CAT GGCCTAT GAGCAG 
AT CAkGCGCCTT GTT GGTAGTGACCAGGAGACT CTGAGGATT CACGAGAGGCTT GTGGCAGGG T C CTT GGCAGGGGCC 
U AT CGC CCAGAGCAGCATCT ACC CAAT GGAGGTC CT GAAGACCCGGAT GGCGCTGCGGAAGACAGGCCAGTACTCAGGA 

O AT GC T GGACT GCGCCAGGAGGAT CCTGGCCAGAGAGGGGGT GGCCGCCTT CTACAAAGGCT AT GT CC CCAACATGCT G 

SI GGCAT CAT C CCCTAT GCCGGCAT CGACCTT GCAGT CTACGAGAC GCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGT G 

IP AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
1 5 1 CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
1% TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
*?* GCT GT GAGCAT CAGCTAC GTGGT CTACGAGAACCTGAAGATCACCCT GGGC GT GCAGT CGCGGTGACG GGGGGAGGGC 

CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGC^GCC^TCTCATTCTGTGAATGTGCC^ACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
W GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 

CT CC GGGCTCACATGT GTAAGGACAGGACATTTTCT G CAGT GCCTGC CAAT AGT GAGCTT GGAGCCTGGAGGCCGGCT 
Ssss, TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
W CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
If] ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCC^GGCCTG^^ 

^ TGTGAAGGAAGAGGAAAGGATCTGGCCTT GTGGT CACTGGCATCTGAGCCCTGCT GAT GGCT GGGGCTCTCGGGCATG 

1*1 CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
111 GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCC^^ 

*;S CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
W ACT GTT GGGAAAAGGGTTTTGT CCAGAAGGACAAGCC GGACAAATGAGCGACT TCTGT GCT TCCAGAGGAAGACGAGG 

f|| GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 

G GGAC CAGCCCCACAT TCCACTT GTGTCACT GCTTGGAAC CTATT TATTTT GTATTT AT TT GAACAGAGT T AT GT C CT 
AACTATTTTTAT AGATTTGTTT AATT AATAGCTTGT CATTT TCAAGTT CATT TTT TATT CATATTTAT GTT CATGGT T 
GATT GTACCTT CCCAAGC CCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTT GGGCCGCTGCAGT CACAT CT 
GTCC^GAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCC^GAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTT GGGGAAGGGCT T GCCCCCAGCCTTAGGATT T CAGGGTTT GACT GGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTTCAAT AGGAT GCAAAGAT CAAT GCAAAAATTGTTAT ATAT GAACAT ATAACT GGAGT C GT CAAAAAG 
CAAAT TAAGAAAGAATTGGACGTTAGAAGTT GT CATTTAAAGCAGCCTT CT AAT AAAGT TGTT T CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



a 5 
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MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDraEFVHYLQDHEKKLRLVFKI LDKKNDGRIDAQEIMQSLRDLGVKI SEQQAEKILKSMDKNG 
TMT I DWNEWRD YHLLHP VEN IPEII LYV7KHS T I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMG I VGGFT QMI REGGARS LWRGNG INVLKIAPE S AI K 
FMAYEQ I KRLVGS DQETLRI HERLVAGS LAGAIAQS S I Y PMEVLKTRMALRKTGQYSGMLDCARR 
IIAIIEGVAAFYKGWPI^LGIIPYAGIDIAVYETLKNAWLQHYAWSADPGVFVLLACGTMS 
GQLiiSYPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNBMKVIPAVS ISYV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAilGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATOGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCAT T GGCT TTGGTAT TT CAGGGAGACACT CCAT CACAGT CACTACT GTCGCCT CAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
C G T ( 3AT AC AAT GGCT GAAG GAAGG T G T T T T AGGC T T GGT C CAT GAG T T C AAAGAAG GC AAAG AT G 
AGCTGTCGGAGCAGGAT GAAATGT T CAGAGGCCGGACAGCAGT GTT T GCTGATCAAGTGATAGT T 
GGCi^ATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 

CAT CAC T T CTAAAGGCAAGGGGAAT GCTAACC T TGAG TATAAAAC TGGAGCC T TCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 

CAGCCCACAGTGGTC TGGGCAT CCCAAGT T GACCAGGGAGCCAACTT CT CGGAAGTC TCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

TCAACAACACAT ACTC CT G TAT GAT TGAAAATGACAT T GCCAAAGCAACAGGGGATAT CAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCTTGGCC^CAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACCTAGTTTTATATTT CTGGGAGGAAATGAAT TCAT AT CTAGAAGTCTGGAGTG 
AGC AAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGC TCCAAT AT GAACAAGATAAAT 
CTAT CT TCAAAGACATAT TAGAAG T TGGGAAAATAAT T CATGTGAACTAGACAAGT GT GT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 

T G T AATGT T GCT CTGAGGAAGCCCCTGGAAAGTC TAT CCCAACATAT CCACATC T TATATTC CAC 
AA?LTTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGATTT TATAAATAAACT GAGCACC TTCTTTT TAAACAAAAAAAAAAAAAAAAAAA 
AA/iAAAAAAAAAAAAAAAA&AAAAAAAAAAAAA 



FIGURE 60 



MASLGQILFWSIISII IILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
D I VI QWLKEGVLGLVHE FKEGKDELSE QDEM FRGRT AVFADQVI VGNAS LRLKNVQLTDAGT YKC 
Y 1 1 T S KGKGNANLE YKT GAFSMPEVNVDYNAS SETLRCEAPRW FPQPT WWAS QVDQGANFSEVS 
NTS FELNSENVTMKWSVL YNVT INNT YSCMIENDIAKATGDIKVTESE IKRRSHLOLLNSK&SL 
CVS SFFAI S WALL PLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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T GACGT CAGAAT CAC CATGGC CAG C TAT CC T T AC C GGC AGGGC TGCC CAGGAGC T GCAGGAC AAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 

T GGAGGGCCC TATGGACACCCCAAT CCTGGGATGT T CCCCTCT GGAACT CCAGGAGGACCATAT G 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 

GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCC.AATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 

C CATGACAGC T T CT CGGATGC TATGACCCAAC CATCTGT GGAGAGTGGAGT GCACCAGGGACCT T 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AAT AGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGAT GGAGGAGAGGATAGAAG T TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCT GTCTATGGGACCAGT GGCT TGGAT TCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT TTGGGGCCAAAAGT CCAGTGAAAT T GTAAGCT TCAATAAAAGGATGAAACTCTGA 
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MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGY I SMKELKQALVNCNWS S FNDE TCLMMINMFDKTKSGRIDVYGFSALWKFI QQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 

DTAVQGNIRLS FEDFVTMTASRML 



O 

Wl amino acids 1-19 



3 = 



hf amino acids 135-138, 150-153, 202-205, 271-274 



i 5 ^ s 

y 

ill 



Important features of the protein: 
Signal peptide: 



N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 



N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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|£2: is 



iri 
ill 



b I 



CAGGjATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGCTCATCTCCAATAAAATAAAAGCACTTATG 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?yW\AAAAAAAAAAAAAA 
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MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites . 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site, 

amino acids 37-41 

N-myr i s toyl ati on s i tes . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



acgg;accgagggttcgagggagggacacggaccaggaacctgagctaggtcaaagacgcccgggc 

CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 

GGAGCAGTT CT CCCATGCAGCC GAGGCCC GGGCCC CT CAGGACTCCAAGGAGAC GGT GCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAGTGAGAT G TAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLP FLLARWGRAWGQ I QT T S ANENST VLPS S TS S S S DGNLRPEAI T AI I WFS 
LLAALLLAVGLAiLVRK^ 

Important features: 
Signal peptide: 
amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites, 
amino acids 7-13, 26-32 
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FIGURE 6q 



5: Sir 



GCCAGGAAT AACTAGAGAGGAACfiATgGGGTTATTCAGAGGTTTTGTTTT CCTCTTAGTT CTGTGCCT GCTGCACGAG 
TCAA AT ACTT CCT T CATTAAGCT GAAT AATAAT GGCTT T GAAGAT AT T GT CAT TGTT ATAGATC CTAGTGT GCCAGAA 
GATGAAAAAATAAT TGAACAAAT AGAGGATAT GGT GACT ACAGCT T CTACGTACCTGTTT GAAGCCACAGAAAAAAGA 
TT TTTTTT CAAAAATGT ATCTATATTAAT TCCTGAGAATT GGAAGGAAAATC CTCAGTACAAAAGGCCAAAACAT GAA 
AACC ATAAACAT GCT GATGT TAT AG TT GCAC CACCTACACTC CCAGGTAGAGATGAACCAT ACACCAAGCAGT T CACA 
GAAT GT GGAGAGAAAGGCGAATACATT CACTT CAC CCCTGAC CTT CTACT TGGAAAAAAACAAAATGAATAT GGACCA 
CCAG GCAAACT GTTT GT CCAT GAGTGGGCTCACCT CCGGTGGGGAGT GTT T GATGAGTACAATGAAGATCAGCCTTTC 
TACCGT GCTAAGT CAAAAAAAAT CGAAGCAACAAGGTGTTCCGCAGGTAT CTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACT GTAT GGAAAAGATTGTCAATT CTTT 
CCT G ATAAAGTACAAACAGAAAAAGCAT CCAT AAT GTT T ATGCAAAGTATTGATT CT GTT GTTGAATT TTGT AAC GAA 
AAAA£CCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACAT GGGAGGT GATTAGCAAT 
T CT G AGGATTTTAAAAACAC CAT ACCCAT GGT GACACCACCT CCT CCACCT GT CTT CTCATT GCTGAAGAT CAGT CAA 
AGAATTGTGT GCT TAGT TCTT GATAAGT CTGGAAGCAT GGGGG GT AAGGACCG CCT AAAT C GAATGAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCT AAT CCAAAT AAAAAGCAGT GATGAAAGAAACACACTCAT GGCAGGATT ACCT ACATAT C CTCT GGGAGGA 
01 ACTTCCATCT GCT CT GGAAT T AAATAT GCAT T TCAGGT GATT GGAGAGCTACATT C CCAACTCGAT GGAT CCGAAGTA 

*[l CT GCT GCT GACTGATGGGGAGGATAACACTGCAAGTTCTTGT ATT GATGAAGT GAAACAAAGTGGGGCCAT T GTT CAT 

W TTTATT GCTT TGG GAAGAGCT G CTGAT GAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTT TTAT GTT 

Ifl TCAGAT GAAGCTCAGAACAAT GGCCT CATT GAT GCT TT T G GGGCT CTTACATCAG GAAATACT GATCT CTC CCAGAAG 

S *H T C CC'TT CAGCTCGAAAGTAAGGGATTAACACT GAATAGTAAT GCCT GGAT GAACGAC ACT GTCATAATT GAT AGT ACA 

W« GT GGGAAAGGACACGTTCTTTCT CATCACAT GGAACAGTCTGCCT CCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 

Ll! AT AAT GGAAAATT TCACAGT GGAT GCAACTT CCAAAAT GGCCTAT CTCAGTATTCCAGGAACTGCAAAGGT GGGCACT 

TGGC5CATACAATCTT CAAGCCAAAGCGAACC CAGAAACATTAACTATTACAGTAACTTCT C GAGCAGCAAAT TCTT CT 
" GTGCCT CCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTT CCCCAGCCCAATGATTGTTTACGCAGAA 

Q ATT CTACAAGGAT AT GT ACCT GT T CTTGGAGCCAATGT GACT GCTT TCATTGAAT CACAGAATG GAC ATACAGAAGTT 

| p TT GGAACTT T TGGATAATGGT G CAGGC GCT GATT CTTT CAAGAAT GAT GGAGT CTACTCCAGGTATTT TACAG CAT AT 

ACAGAAAAT GGCAGATATAGCTTAAAAGTTCGGGCTCAT GGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCT CCA 
Q CTGAAT AGAGCCGCGT ACAT ACCAGGCT G GGT AGT GAACGGGGAAATTGAAGCAAACCC GCCAAGACCTGAAATT GAT 

m\ GAGC3AT ACTCAGACCACCTT GGAG GATTT CAGCCGAACAGCATCCG GAGGT GCATT T GT GGTAT CACAAGT CCCAAGC 

CTTCXCTT GCCTGACCAATACCCACCAAGTCAAAT CACAGACCTT GAT GC CACAGTT CAT GAGGATAAGATTATT CTT 
Q AC AT GGACAGCAC CAGGAGAT AATT TT GATGTTGGAAAAGTT CAAC GTT ATAT CAT AAGAATAAGTGCAAGT ATT CTT 

iu gat(:taagagacagttttgatgatgctctt<^gtaaatactactgatctgtcaccaaaggaggccaactccaaggaa 
• - agctttgcatttaaaccagaaaatatctcagaagaaaatgcaacccacatatttattgccattaaaagtatagataaa 

agcaatttgacatcaaaagtatccaacattgcacaagt^ 

cctacacctactcctactcctactcctactcctgataaaagtcataattctggagttaatatttctacgctggtattg 
tctgtgatt gggt ctgttgtaattgttaactttattttaagtaccaccatttgaaccttaacgaagaaaaaaatcttc 
aagtagacctagaagagagttttaaaaaac^aaacaatgtaagtaaaggatatttctgaatcttaaaattcatcccat 
gtgtgatcataaactcataaaaataattttaagatgtcggaaaaggatactttgattaaataaaaacactcatggata 

T GTiXAAAACT GTCAAGAT TAAAATTTAATAGTTT CATT T ATTTGT T ATTTTATTT GTAAGAAAT AGT GATGAAC AAAG 

atcctttttcatactgatacctggttgtatattatttgatgcaacagttttctgaaatgatatttcaaattgcatcaa 

GAAAT T AAAATCAT CTAT CT GAGTAGT CAAAATACAAGTAAAG GAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEI<RFFFKNVSILIPEOTKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQS IDS WEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVI SNSEDFKNTIPMVTPPPPPVFSLLKI SQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVOT^ 

YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWISTSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTS^ 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFYVSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 

FDVGKVQRYI IRI SAS ILDLRDS FDDALQVNT TDLS PKEANSKES FAFKPENI SEENATHI FI AI 
KS I DKSNLT SKVSNIAQVTLFI PQANPDD I DPT PTPTPT PT PDKSHNSGVN I S TLVLSVI GS WI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern* 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 

CT CCT T AGGT GGAAACCCTG GGAGTAGAGTACT GACAGCAAAGAC CGGGAAAGACCATAC GTCCCCGGGCAGGGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCTT CCCT T CT TGAACCCCCAGTTATGCCAGGAT TT ACT AGAGAGT GT CA 
ACTC AACCAGCAAGCGGCT CCT T CGGCTTAACTT GTGGTT GGAGGAGAGAACCTTTGT GGGGCTGCGTTCT CT T AGCA 
GTGCTCAGAAGTGACTTGCCT GAGGGT GGACCAGAAGAAAGGAAAGGTCCCCT CTTGCTGTTGGCTGCACATCAGGAA 
GGCT GT GAT GGGAAT GAAGGT GAAAACTTGGAGAT TTCACTT CAGT CATT GCTTCT GCCT GCAAGAT CATC CT TT AAA 
AGTAGAGAAGCT GCTCT GTGT GGTGGTTAACT CCAAGAGGCAGAACTCGT T CTAGAAGGAAATGGAT GCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAMGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGA 

CGCAACTACGTGAGCAGCCT GAAGCGGCAGAT CGCACAGCTCAAGGAGGAGCT GCAGGAGAGGAGTGAGCAGCT CAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCTTCCT GCACT C GCAGGT GGAC AAGGCAGAGGT GAAT GCT GGCGT CAAGCT GGC CACAGAGT AT GCAGCAGTG 
CCTTT C GAT AGCTTTACTCTACAGAAGGTGT AC CAGCT GGAGACT GGCCTTACCCGCCAC CCCGAGGAGAAGC CT GTG 
AGGPAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
C CCAAT CACCGTCCTTACACGGC CTCT GATTT CAT AGAAGGGATCT AC CGAACAGAAAGGGACAAAGGGACAT T GTAT 
GAGCT CACCTTCAAAGGGGACCACAAACACGAATT CAAACGGCT CATCTT ATT TCGACCAT TCAGC CCCAT CATGAAA 
GT GAAAAATGAAAAGCTCAACAT GGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGT GGACAAGTTC 
CGGCJAGTTCAT GC AGAAT TT CAGGGAGAT GT GCAT TGAGCAGGAT GGGAGAGT CCATCTCACTGT TGTTTACTTT GGG 
AAAGiAAGAAATAAATGAAGT CAAAGGAATACTTGAAAACACTT CCAAAGCT GCCAACTT CAGGAACTT TACCT T CATC 
CAGCT GAATGGAGAATTTTCT CGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTT CTC 
TTTTT CTGT GATGT GGACAT CTACTTCACAT CTGAATT CCT CAATACGT GTAGGCT GAAT ACACAGC CAGGGAAGAAG 
GTATTT TAT CCAGTTCTTTT CAGT CAGT ACAATCCTGGCATAATATAC GGCCACCAT GAT GCAGTC CCTCC CTT GGAA 
CAGCAGCT GGTCATAAAGAAGGAAACTGGAT TTT GGAGAGACTTT GGATT T GGGAT GACGT GTCAGT ATC GGT CAGAC 
TT CAT CAAT ATAGGT GGGTTT GAT CT GGACAT CAAAGGCT GGGGCGGAGAGGAT GT GCAC CTTTATCGCAAGTAT CTC 
CACAGCAAC CT CATAGT GGT ACGGACGCCT GT GC GAGGACTCTTC CACCT CTGGCATGAGAAGC GCT GCAT GGACGAG 
CT^CCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGGGCATGCTG 
GTGTT CAGGCACGAGATAGAGGCT CACCT T CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 
GAAGGATTGT GGGAGACACT TTT T CTTT CCTTTT GCAAT TACTGAAAGT GGCT GCAACAGAGAAAAGACTT CCAT AAA 
GGACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGCCTC CGATTTCTCT CTGT T G GGCTTTTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTT GCACCCTGTGAAGT GTCTGACAAAGGCAGAATGCTT GTG 
AGATTATAAGCCT AAT GGTGT GGAGGTTT TGATGGT GTTTAC AAT ACACT GAGACCT GTTGTTTTGT GTGCT CATT GA 
AATATTCAT GATTTAAGAGCAGT TTTGT AAAAAATT CATTAGCAT GAAAGGCAAGCATATT TCT CCT CATAT GAAT GA 
GCCTAT CAGCAGGGCT CTAGTTT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGAT AGGCTTAT TAT GATACT 
AGTGAGTAC AT TAAGT AAAAT AAAATGGACCAGAAAAGAAAAGAAACCATAAATAT C GTGT CATATTT TCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTT CCCTTGT GAGTTATAGT CTGCT TATTTAATTAC CACT TT GCAAGCCT TAC AAGAGAGCACAAGTT GGCCT AC 
AT TTTT ATATTTT TTAAGAAGATACTTT GAGATGCATT ATGAGAACTTT CAGTTCAAAGCAT CAAAT T GAT GCCATAT 
C CA^GGACATGC CAAAT GCT GATT CTGT CAGGCACTGAATGT CAGGCAT T GAGACAT AGGGAAGGAAT GGT TT GTACT 
AATACAGACGTACAGATACTTT CT CT GAAGAGTATT TT CGAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAATGAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTC^GACTGGTGATATCGTGATGTACCTAAAAGTC^GAAACCACATTTT 
CT C CTCAGAAGTAGGGACCGCTT T CT T AC CT GTTTAAATAAACCAAAGT ATAC CGT GTGAACCAAACAATCT CT TTTC 
AAAkCAGGGTGCT C CT C CTGGCTT CTGGCTT CCATAAGAAGAAAT GGAGAAAAATAT ATAT ATATATATAT ATATTGT 
GAAA.GAT CAATCCATCT GCCAGAATCT AGTGG GAT GGAAGTTT TT GCTACAT GTTAT CCACCCCAG GCCAGGT GGAAG 
TAACT GAATTATTT TTT AAAT TAAGCAGTTCT ACTCAAT CAC CAAGATGCTT CTGAAAATT GCATTT T ATTACCAT TT 
CAA ACTATT TTTTAAAAATAAATACAGTTAACATAGAGT GGTT TCT TCATTCAT GT GAAAAT TATTAGCCAGCACCAG 
AT GCAT GAGCTAATTAT CT CTTTGAGT CCTT GCT TCT GTTT GCT CACAGTAAACTCATTGTT TAAAAGCTT CAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATT GTATTGATTT GTACTGGTAGTTTAT GAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGTGGTATT GCACAGCTAATAAAATATGATTTGT GGATATGAA 



FIGURE 72 



M^mRGLIAWISRVVVLLVLLCCAISVLYMIACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 

EQHimYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AEVNAGVKLATE YAAVPFDS FT LQKVYQLE TGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENS PNHRPYTASDFI EG I YRTERDKGTL YELT FKGDHKHE FKRLI L FRP FSP IMKVKNEKLNMAN 

TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVK 

TFI QLNGE FSRGKGLDVGARFWKGSNVLLFFCDVDI YFT SE FLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNL I WRT PVRGL FHLWHEKRCMDELT PEQYKMCMQSKAMNE ASHGQLGMLVFRHE I EAHL 

RKQKQKTS SKKT 

Important features: 
Signal peptide : 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACT GCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T TGTCCT GGGGATCCA 
GAAi^CCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACT CACGCCAGGAGCTCGCT CGCTCT C TC T C TC T CT CTCTCACTCC TC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTC^CACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATAT TCAGACAGACAGTGTGACAT T T GAC CCT GAT T T GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGAT CAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGT CT CT GCCT TCTCCTGGCTGT TTATT TCAT TGCTAGAAAGAT TCGGAAGAAGAGGCTGGAAAA 
CCG AAAGAGTGTGGT C T T CACCTCAGCACAAGCCACGACT GAGGCATAAAT TCCT T CT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCCTTAGGATAAAGAGT TGCTGTTGAAGT TGTATATT TT TGATCAATATAT TT GGAAAT TAAAG 
TTTCTGACTTT 



FIGURE 74 

ML FSALLLEVT W I LAADGGQH WT YEGPHGQDHWPAS YPE CGNNAQS P I D I QTDSVT FDPDLPALQ 
PEGYDQPGTEPLDLHNNGHTVQLSLPS TLYLGGLPRKY VaAQLHLHWGQKGS PGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 

WFT S AQAT TEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACT CCACCAGAGGGTCT TCAAGGAGTTAAAGTTACT TACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGT GGAAGAGAAATCCAGAAGACC T TCC TGT T TCCATGCAACAAATATAC T CCAAT CTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
H ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
JJJ AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
Jp CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
||| T CT G T GATGGGC TAT TCCAT CTACCGATATATC CACGT T GGCAAAGAGAAACAC CCAGCAAATTT 

III GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 

IJI ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 

hi 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

,p| GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 

Iff AAAACAC GGAAGGTAC T TCTCTCACCCAGCAAGAGT CCC TCAGCAGAACAATAC CC CCGGATAAA 

Cl ACAGT CAT T GAATAT GAATATGATGTCAGAACCAC TGACAT T T GT GCGGGGCCT GAAGAGCAGGA 

5 ^ GCT CAGT T T GCAGGAGGAGGT GT CCACACAAGGAACAT T AT TGGAGTCGCAGGCAGCGT T GGCAG 



TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCC GGAGGAAGAGCCAT CGACGACCC TGG T CGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAAT GAAACCTATCTCATGCAATTCATGGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSTOGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVTGPPRi^QPSEKQCARTLK 
DQS SE FKAKI I FW YVL P I S I T VFLFSVMGYS I YRYI HVGKEKHPANL IL I YGNE FDKRFFVPAEK 
IVIlSfFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSESNTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALA^LGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 



AppJD=10063553 



Page 228 



FIGURE 77 

GAGC3AGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGi^AAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAATGAAAAGGC TGCAGATAAGCTGGGATCTACCCAGAT CGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT T ACACCAAAGGTGACCAACT TATACTCAACT TGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 

CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKHI I WLKVI TANI LQLQWPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQAT IRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI EAS FNGMYADLLQLVKVP I SLS I DRLE FDLLYPAI KGDT IQLYLGAKLLDSQGKVT 
KWFMNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DRIQLMNSG I GWFQPDVLKNI I TE I IHS I LLPNQNGKLRSGVPVSLVKALG 
FEAAE S S LT KDALVLT PAS LWKP SS PVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 4 07-457 



App_ID=10063553 



Page 230 



FIGURE 7Q 



m 



GAGAGAAG T CAGCC T GGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGT T CAAGGAGCAAGAGC 
T TCAGCCTGAAGACAAGGGAGCAGTCCC T GAAGACGCTT C T AC TGAGAGG TC TGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
13 GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 

ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
HI CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
*W GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
JL GGTCT GT GAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

|S GT C AGAAGGTGC TGC TGAGGAT AGACTGAC T T T GGCCAT TGGAT T GAGCAAAGGCAGAAATGGGG 

P! GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
III TCA.CCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTT GTGCATGAACT GAAATAAAACCATCCTACGGTAT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYI LGLLGLLGTLVAMLLPSWKTSS YVGAS I VTAVGFSKGLWMECATHSTGI TQCD 
I YSTLLGLPADIQAAQAMMVTSSAI SSLACI ISWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNS YSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCGGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACT GCCAGACT CCT GC TGCCAAGCCT GCAAAGATGAGGCAAGT GAGCAAT CGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGACCCAAGGGAGCAGGCAGCACAACTGT CAAGAT CG T CCT GAAGGAGAAACATAAGAAAGCCT 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGT CAC CCCGAGAAAGTGGCT GGGAAGTGCT GCAAGAT T T GCCCAGAGG 

AC^AAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 

G TCAAG TACCTGGCCCAAGGCCACACAGCCAGAATCT T CCACTTGACTCAGATCAAGAAAGTCAG 

G^GCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 

GGAAC GT CT TCCTAGCCCAGAC CCTGGAGC TGAAGGT CACGGCCAGTC CAGACAAAG T GACCAAG 

ACATAACAAAGACCTAACAGTT GCAGATATGAGCTGTAT AATTGT TGT TATTATATATTAATAAA 

TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 82 



MVPEVRVLSSLLGIJU^LWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSS DAGRKRGPGT PAPT GLSAPLS FIPRH FRPKGAGS T TVK I VLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGCjCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGT TTCCAT GCC GTGAGGTCCATTCACAGAAC AC ATCC ATG GCTCT C ATGCT CAGTTTGGTT CTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTTCTCT AGCGTGGTCC AC CT CT AC AGGGACGGGAAGGACC AGCC ATTT AT GC AGAT GC CAC AGT ATC 
AAGC;CAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGTGT CAGCACT GGGCT CAGT TCCTCT CATT TCCATCAC GGGATATGTT GAT AG AGACAT C C AGCT AC 
| TCTGTC AGTC CT CGGGCTGGTT CC CCCGGCCCACAGCGAAGT GGAAAGGT CC ACAAGG ACAGGATT TGTCC 

. j ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
111 CGCC GGGAGC AT AT CCTGT T CC AT GCGGCATGCTC ATCTGAGCC GAGAGGTGGAATCC AGGGT AC AGAT AG 

10 GAGiVTACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

= . * 

W- TTT T TTGGCATT GT TGGACT GAAGAT TTTCTT CTCC AAATTC CAGTGG AAAAT CC AGGCGGAACTGGACT G 



saw? 
f 55 1 



a 



f ar 5 



FU 



GAGi^AGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC ACTCT GAGAAG AG ATT TACAAGGAAG AGTGTGGT GGCT TCTC AG AGTT T CCAAGCAGGGAAAC ATT A 
CT GGGAGGTGGACGGAGG ACAC AAT AAAAGGT GGCGCGTGGGAGTGTGCCGGGATG ATGT GGAC AG GAGG A 
AGGAGT ACGT GACTT T GTCT CCCGATCATGGGTACTGGGTCC TCAGACTG AAT GGAGAAC AT TTGT ATTT C 
ACAT TAAATCCCCGTTTT AT CAGCGTCTTCCCCAGGACCCCACCT ACAAAAATAGGGGTCTT CCTGGACT A 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTAT T GAGGCCCT AC AT TGAGTAT CCGT C CT AT AATGAGC AAAAT GGAACT CCC ATAGT C ATCTGC 
CCAGTC ACCCAGGAATC AGAGAAAGAGGCCTCT T GGCAAAGGGC CTCTGC AAT C CC AG AG ACAAGC AAC AG 
TGAGTCCTCCTC ACAGGC AACCACGC CCT TCCT C CC CAGGGGTGAAATG TAGG ATG AATC AC ATCCC AC AT 
T CT T CT TT AGGGAT ATTAAGG T CT CTCT CCCAGATC C AAAGT CCCG CAGC AGCCGGCC AAGGTGGCTT CCA 
GATGAAGGGGGACTGGCCT GT C CAC ATGGGAGTC AGGTGTCATGGC TGCCCT GAGC TGGG AGGGAAGAAGG 
CTGACATT ACAT T T AGTTT GCT CT C ACT CCAT CT GGCT AAGT GAT C TTGAAAT ACC ACCTCT CAGGTG AAG 
AACC GTC AGGAATTC C CATCT C AC AGGCTGT GGT GT AG ATTAAGT AGACAAGGAAT GTGAAT AAT GCT TAG 
AT CT T ATTGATGACAG AGTGT ATCCT AAT GGT TT GTTC ATTAT AT T ACACTT TC AGT AAAAAAA 
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FIGURE 84 



m 
m 

VST \ 



m 



?= a 



malmlslvlsllklgs gqwqvfgpdkpvqalvgedaa.fsc flspktnaeamevrffrgq fs swh 
lyrdgkdqpfmqmpqyqgrtklvkdsiaegrislrlenitvldaglygcrissqsyyqkaiwelq 
vsa:lgsvplisitgyvdrdiqllcqssgwfprptakwkgpqgqdlstdsrtnrdmhglfdveisl 

TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 



f s 3 amino acids 239-255 
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FIGURE 85 



AACAGACGTTCCC'l'CGCGGC CC TGGCACC TCTAACCCCAGACATGC TGCTGCT GCTGCTGCCC CT 
GCT CT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACT GCTGACGAT GCAGAGT TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGAT GCCAGAAGAAGTGAT GCGGGGAGA 
TACT TCT T TCGTATGGAGAAAGGAAG TAT AAAATGGAAT TATAAACATCACCGGC TC T CT GTGAA 



Q TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
0 AGAJ^TCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
ill GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACA^GACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 



m CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
flf GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
0 CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
fl1 TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAG AAACT GCAGAGAC TCACCCTGAT TGAGGGAT CACAGCC CCT CCAGGCAAGGGAGAAGTCA 
GAGGC TGATTCT TGTAGAATTAAC AGCCCTCAACG T GATGAGC TATGATAACACTATGAATTATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQS SVTVQEGLCVHVPCS FS YP SHGW I YPGPWHGYW FREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVS LQSKATS GVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 



YSEIKIHR 




Signal peptide: 



amino acids 1-15 



Transmembrane domain: 




amino acids 351-370 
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FIGURE 87 



AGA^GCTGCACTCTGTTGAGCTCCMGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAPlGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
T CATAGCGACCACCAGAGGATGGAG TACAGATGAGGC TAAT ACT T AC T T CAAGGAAT GGACCT GT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
C AAC T AC AACAC C T T T GGAT C T G C AGAGGCGGC CAC GAGC GAT GAC T ACAAG AAC C C T GGC T AC T 
JJJ ACGACAT CCAGGCCAAGGACCT GGGCATCTGGCACGT GCCCAATAAGTCCCCCATGCAGCACTGG 

71 AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 

W 

|fl GTT TGGCAT CTACCAGAAATAT CCAGTGAAATAT GGAGAAGGAAAGTGTT GGACT GACAACGGCC 

i|| CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

W GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 

CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

5 s 

iti GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

III 

O GGAACTCAT GT TGGT TACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGC TTCTATTCTATCG 

f|| T TGAG AGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 

Ut CAACT T ACCCAG T AGC TAGAAT GTTAATGGCAGAAGAGAAAACAATAAAT CATAT T GACTCAAGA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMT SGGGGWTLVASVHENDMRGKCTVGDRWS SQQGSKM)YPEGDGNWAISrYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS P YGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGGYFPEAS PQQCGDFS G FDWS GYGTHVG YS S SRE I TEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 80 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCMGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGGCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
T CACTGAAATGGCT T TAT TCGT CACC GTC T T TGGGCTGAAAAAGAAACCC T TCTGAT TACCT TCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGAC T T ATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE QQ 

MGRVSGLVPSRFLTLLAHLWVI T LFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQS L I S I GAHCSASVALS FFI FEEWECT TYWY I FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAA.CCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
|„i GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
C3 CCAAXTACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTM 
D AGT TCCACCAGGAACAGCAGAT CTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 

?l GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 

w 

|R CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

|f| T CT ACAT CGCC T TCTACAGCACGGACTATCGGCTGG T CCAGAAGGT GT GC CCAGAT TACAAC TAC 

W CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

l_ TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

r'J AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

yi 

AGAAAGGGT CCCAAGT GC TGGT CCCAACC T GAAGC TGTGGAGT GACTAGATCACAGGAGCACTGG 
f|J AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
CI TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCTT GGT TCTTGCCAT CC TGAGGAAAGATAGCAACAGGGAGGGGGAGAT T T CAT CAGT G T GGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CAT GAGGCT GAAGTGGCAACCCT GGGGTC T T TGATGT C T T GACAGAT TGACCATCTGT C TCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACC CATCC T TAAGCTAAGACAGGACGAT TGTGGTCCT CCCACAC T AAGGC CACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



mqltrccfvflvqgslylvi cgqddgppgsedperddhegqprprvprkrghi spksrpmans tl 
lgl:lappgeawgilgqppnrpnhspppsakvkkifgwgdfysniktvalnllvtgkivdhgngtf 
svhfqhnatgqgn isis lvppskave fhqeqqi fi eakaski fncrmewekvergrrts lcthdp 
akicsrdhaqssatwscsqpfkwcvyiafystdyrlvqkvcpdynyhsdtpyypsg 

Important features of* the protein: 
Signal peptide: 
amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATG^CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 

ATGT T CCGAT T T GCATATTATAAACT CT TAAAAAAAGCCAGTGAAGGT T TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATT C T CC TCAAT TC TT CCTTTAT TCAGCT T TCATGACGCTGG T CAT TATC TT GC TGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCT CAGGGAACCAGCACTTCCCAAACCGCAGAC TACAT CTT TAGAGGAAGCACAACTGT GCCT 
T TT TCTGAAAATCCCTTT TTCT GGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE 

MTAAVFFGCAFIAFGPALALYVFT IAIEPLRI IFLIAGAFFWLVSLLISSLWFMARVI IDNKDG 
PTQKYLL I FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSF^NTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFI SS YYGINLASAFI ILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 



Important features of the protein: 
Signal peptide: 



n 

s-va amino acids 1-19 

Ui Transmembrane domains: 

W amino acids 32-51, 119-138, 152-169, 216-235 

m 

3.3 

2 Glycosaminoglycan attachment site. 

□ amino acids 120-123 

V; ! I 

Sodium: neurotransmitter symporter family protein 

fll 
rsi 

s w 



amino acids 31-65 
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FIGURE Q5 

AAT T T T T CACCAGAGTAAACT T GAGAAAC CAAC TGGACCT T GAG TAT TGTACAT TTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATT CTACTGTT TTGTCT T CTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACT AC CAAACCAACAGCAGTCAAAT CAGGTC T TTCCT T C T T TAAGTC TGATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCAT T GACCCTGGGAGGGT TGAA.TG TACAACAGCAACTGCACCCACAT GT GT TAC 
CAAT T T T TGT CACACAAC T TGGAGCCCAGGGCACTAT CCTAAGCTCAGAGGAAT TGCCAC AAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TA^G CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGATTATAT TATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAAT TAAT TCTTAATT 
TAC CT GAAAATAT TCT TGAAAT T T CAGAAAATATGT T CTATGTAGAGAAT CCCAACT TT TAAAAA 
CAAT AAT T CAAT GGAT AAAT CTGTCTTT G AAAT AT AAC AT TAT GC T GC C T GGAT GAT AT G CAT AT 
TAAftACATAT T T GGAAAAC T GGAAAAAAAAAAAAAA?^A7\AA7\AAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID= 10063 553 



Page 247 



FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQI FTS L 1 1 HSLFPGGI LPTS QAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPT PS G 
T DD D FAVT T PAG I QRS T HAI EEAT TE SANG I Q 



Signal peptide: 

amino acids 1-16 



s •; :: 

3 

5 .. a 



111 
fl 

rs - 
3 B % 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 

GCAGGCAGAGTACCAGC TACAGG T CACCC T GGAGATGCAGGAT GGACATG TC TT G T GGGGTCC AC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGA.TGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCA.GGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 

CACT GGAGT GGGGG T GAT G T GCACTATCACC TGGAGAGCCATCCCCCGGGACCCT T TGAAG T GAA 
T GCAGAGGGAAACC TC TACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGC TGAGTAC CTGCTCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCGACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

C AGiG T G G AC C C CAC T T CAG GCAG T G T GAC GC TG GGGG T G C T C C CAC T C CGAG CAGG CC AGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
AT^GCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
GmGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
AaiCCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GMCACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACC CAC T GGACCAT GTCAAGGAAGAAGGACC CGGATCAACCAGCAGACAGCGT GCCCCT 
GAi\GGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCC CC TGGGAGAGAGC CCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCCT CCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TAT GGACTGCCCATGGGAGT GCTCCAAATGTCAGGGT GT TT GCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQMSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADMGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHE WWVQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQE S YEAS VP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PWVSHNAQMWQL LVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVG I LVGTLVAI GI FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide : 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACAT TGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCT GCCCACGCCTG 
AGT CC AAGATTC T TCCCAGGAACACAAACGTAGGAGACC CACGCTCC T GGAAGCACCAGCCT TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCAT C T TGCT T T CCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
T TCTC CT TAT GT T T GG TC TACT AT T GCAT T TAGAAGCTGCAACAAAT TCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAAC TCTGAGT T CCATACAACC TCCAGTGGGAT CAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
!f GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
C3 CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
O AGT AGGGCCAGCAC T GCCACCAACTC T GAGTCTAGCACACT CT CCAGTGGGGCCAGCACAGCCAC 

CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCA.CAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGC TAGCACAGCCACCAACT CT GAC T CCAGCACAACCTCCAGT GGGGCCGGCACAGCCACCAACT 
C TGAGTC CAGCACAGTGTCC AGTGGGATCAGCACAGT CACCAAT TCTGAGT CCAGCACACCCT CC 
W AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
111 CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 

CCTCCA-GTGGGGTCAGCAC^GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
;l CACAGTGTCCAGTGGGATCAGCACAGTCACCAAT TCTGAGT CCAGCACAACCTCCAGTGGGGCCA 

J- ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
II ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G TGGAG T CC TAAC TGG T TCT GGAGGAGACCAGTATCAT CGATAGCCATGGAGAT GAGCGGGAGGA 
ACPlGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
TGCTCATT TAGC TAAGAAAT AAATACATCTCATCTAACACACACGACAAAGAGAAGCT GT GCT TG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATC T GGCATTCAAAATCT CCACAGTAAAATCCAAAGACC T CAAAAAAAAAAAAAAAAAAAAAAA 
AA2AAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 
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FIGURE 100 

MKMQKGWLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVS IVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSL S LRNT FNT AVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVS S I 
AMEMSGRNSGP 

Sigrnal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAAC TATCAGGCTGAAAAACAGAGTGGGTAC TCTC T TC TGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACiSAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAAC TGGGGATGCAT T T GT TCCTGGAGAAAGAAGT GTCAT TAT CATGAACCAT CGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
|f| TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
111 CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
W ATA.T T CACGAACCAC T TCAACT CCTCATAT TCC CAGAAGGGACTGATCTCACAGAAAACAGCAAG 

JL s TC T CGAAGTAAT GCAT T TGCTGAAAAAAAT GGAC TTCAGAAATATGAATATG TT T T ACATCCAAG 

JjJ AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
pi ATA.TCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 

CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGG AGAAGAATT TTTATT TTACCGGACAGAGTGTCAT TCCACCT TGCAAGTCTGAACT CAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATT TGGTGGACTGGAGAT CATAGAACTTGCATGT TACCGACTT TTACACAAACAGCCACAT 
T T.SAAT TCAAAGAAAAATGAGTAAGAT TATAAGGTT T GCCATGTGAAAACCTAGAGCATAT T T TG 
GA2 AT G T TC TAAACC T T TCT AAGCT CAGAT GCAT T T T T GCATGAC TAT GTCGAATATT TCTTAC T 
GCCAT CAT TAT T TGT TAAAGATAT T T T GCAC TTAAT T TT GT GGGAAAAATAT TGC T ACAATT TT T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAAC AATC ATCAGGC TT TTAAA 



•P! 

cn 
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m 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLP:UMFWPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDI#I 
FLWNCI11RYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEIIELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGMCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAA.CCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
ATT AT AGT CCAGGAAAATGCTGGGAGCATAT T GT GT T CCAT CCACC T TGC TGAGCAGAG T CATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
fl TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
lU AAATCCAAAGGGAAAATCCAGGCGGAACTGGAC T GGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

Iff AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 

TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGT GGGACAAAATGTAGGG T GGTAT GT GGGAGTGTG T CGGGAT GACGTAGACAGGGGGAAGAACA 
AT G T GAC T T TGT C TCCCAACAATGGGTAT TGGGT CCT CAGACT GACAACAGAACAT TT GT AT T T C 
ACA.TTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGA.CTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGA.CATGTCAGTT TGAAGGCTT GTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
GGGACTCCCATAT TCATATGTCCAGT GTCCTGGGGATGAGACAGAGAAGACCCTGCT TAAAGGGC 
!U CCCACAC CACAGACCCAGACACAGCCAAGGGAGAGT GCTCCCGACAGG TGGCCCCAGCT T CCTCT 

J3 CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
ri| TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCAT GGC TGCCCT GAAGT GGGGAC GGAATAGACT CACATTAGGT T TAGT TT GTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGT GAT T CC TGCCT CACAGGT GAAGATTAAAGAGACAAC GAATGT GAAT CAT GC TT GCAGG TT 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTATTCCACATTAATTTACTTTTCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
T GCT T TGTGAGGCTCAAAGAATAAAGAGGAGGTAGGAT T T T TCAC TGAT TCTAT AAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT TAAC ACAGACAC AAAAAT T C T AAAT AAAAT T T T AAC AAAT T AAAC TAAACAAT AT AT T T A 
AAGrAT GATATATAACTACT C AGTGTGGT T TG TCCCACAAAT GCAGAGT TGGT T T AATAT T TAAAT 
ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVX I LVLS F YEL VSGQWQVT GPGKFVQAL VGE DAVFS CSL FPET S AEAMEVRFFRNQ FHAWH 
LYRDGEDWESKQHPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
Viy\LGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGS ILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMI IVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QG FQAGRHYWE VDVGQNVGWYVGVCRDDVDRGKNNVTL S PNNG YWLRLT TEHL YFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE log 



CCTT C AC AGGACTCT T CAT T GCT GGT TGGC AATGATGT AT CGGCC AGATGT GGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TGACAAACTAT ATGCT GAGT T T GGCAGAGAGGCT T CTAACAATT TT AC AG AAAT GAGCCAGAGACTT G AAT 
CAAT GGTGAAAAATGC ATTT T ATAAATCT CCATT AAGGGAAGAAT TTGTC AAGT CT CAGGTT AT CAAGTTC 
AGTC AAC AGAAGCAT GGAGT GT T GGCTCAT AT GCTGTTGATT TGT AGATT T C ACTCT ACTGAGGATCCTG A 
1^. AACT GT AG AT AAAATTGT TCAACT TGTTTT AC AT GAAAAGCT GC AAGATGCT GT AGGACCCCCT AAAGTAG 

§*\ ATCCTC ACTC AGT T AAAAT T AAAAAAATCAAC AAGACAGAAACAGACAGCT ATCT AAACC AT T GCT GCGG A 

Q AC AC GAAGAAGT AAAACTCT AGGT C AGAGTCT CAGG ATCGTT GGT GGG AC AGAAGT AG AAGAGGGTGAATG 

y! GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
W TT GT GAGTGC TGCTC ACTGTTT T ACAAC AT AT AAGAACCCT GCCAGAT GG ACTG CTTCCTTT GGAGT AACA 

U I. AT A AAACCT T CGAAAATG AAACGGGGTCTCCGGAGAAT AATT GTCC ATGAAAAAT ACAAACACCCATC ACA 

f I TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
W TCC C TG ATGC AT CCT ATG AGTT TC AACC AGGTGATGTG ATGT TT GTGAC AGGAT T T GG AGCACT GAAAAAT 

GAT GGT T ACAGT CAAAATCATC TT CGACAAGCACAGGTGACT CT C ATAGACGCT AC AACTTGCAAT GAACC 
in TCA AGCT T ACAATGAC GCCAT AACTCCT AG AATGTT ATGT GC TGGCTCCTT AGAAGGAAAAACAGATGC AT 

*4 GCC AGGGT GACT CTGG AGGACC AC TGGT T AGTTCAG ATGCTAGAGATATCTGGTACCTTG CT GGAAT AGTG 

fll AGCT GGGG AGAT GAATGT G C GAAACCCAAC AAGC CTGGTGTT T AT ACT AGAGTT ACGGCCT T GCGGGACTG 

C| GAT T ACTTCAAAAACT GGT ATCTAAGAG AC AAAAGC CTCATGGAAC AG AT AACAT TTT TTTTTGTT TTTTG 

f|| GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACTCT GT CAT C TGT GAGCAAT AGTTGAAACT TT AT GT ACAT AG AG AAAT AGAT AATACAAT ATT AC 
AT T AC AGCCT GT AT T C ATTT GT T C TCT AGAAGTT TTGTCAGAAT TTTGACT T GT T G AC AT AAAT TTGT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TC5AGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGrT ATTAGGT GTT TTTCT T AGTGGAAT AT T AGAAATG ATCAT AT TC ATT AT GAAAGGTCAAGC AAAGAC A 
GC AGAATACC AAT C ACTT CAT C AT TT AG GAAGT AT GGGAACT AAGTT AAGGAAGTCC AGAAAGAAGCC AAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT JsAT AATT AT ACAAACTT CAT GCAATGT ACT TGTTCT AAGC AAATTAAAGC AAAT AT TT ATT T AAC ATT G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 



MMYRPDWRARKRVCWE PWVI GLVI FI SL I VLAVC I GLTVHYVRYNQKKT YNYYS TLS FTTDKL Y 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQK^ 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKiyiKRGL 
RRIIVHEPCYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DAT T CNE PQAYNDAI T PRMLCAGS LEGKT DACQGDS GG PLVS SDARD I WYLAG 




Transmembrane domain: 



amino acids 21-40 (type II) 



IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCiSACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAF GT TCTCACT T TAACT GC AGAAACGAGGG TAGAGGAAGCAGTCATTT TGAC TTACT T TCC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCT TGTGGCGT T TGGACATAT GAACAGGAACT TATGG T TCCAGTACAAT GGTCA 
GAT AT GGTCACTTTGAAAGCCAGGATGACAAATTATGGAT TACCTAGATATCGGTGGCT TACTCA 
TGC T T GGAAT T T TT TTCAGAGAGAG T T TAAGTGCTG T GGAGTAGTATAT T TCAC TGAC TGGT TGG 
AAAT GACAGAGATGGACTGGCCCCCAGAT TCC T GC TGTGT T AGAGAAT TCCCAGGAT GT TCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
T TT GAGAGGAACCAAACAACT GCAGGTGCTGAGGT T T CT GGGAATC T C CAT T GGGGTGACACAA& 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACC AAAT GAT G T CC T T GAAGAAT GACAAC T C T C AG CACC T G T CAT G T C CC T C AGT AGAAC T 
G T T GAAACCAAGCC T GTCAAGAAT C T T TGAACACACATC CATGGCAAACAGC TT TAAT AC ACAC T 
T T GAGAT GGAGGAGT TATAAAAAGAAATGTCACAGAAGAAAACCACAAAC T TGT T TTATT GGAC T 
T GT GAAT TT TT GAG TACATACT AT GT G T T TCAGAAATAT G T AGAAATAAAAATGT TGCCATAAAA 
TAACACCTAAGCATATAC TATTCTAT GCT T TAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 
ACC ACCT GGACAATAATTGATGCCCT TAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
T GT G TAT GACT T TTAC TGAACACAGT T ATGT T T TGAGGCAGCATGGT T TGAT TAGCAT T T CCGCA 
TCCAT GCAAACGAG TCACAT AT GGTGGGACTGGAGCCAT AGTAAR.GGT TGAT TTACT T C T ACCAA 
CTAGTATATAAAGTACTAAT TAAATGCTAACATAGGAAGTTAGAAAATACTAATAACTT TTATTA 
C TCAGCGAT CTATTCT TCTGAT GCTAAATAAAT TATATATCAGAAAAC T T T CAATAT T GGTGAC T 
ACCTAAATGTGATT T T TGCT GGT TAC TAAAATAT TC T TACCACT TAAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGT T CTGATAATGT TAAGAATAACCATTATGAAAAGGAAAAT T TG TCCT GT AT AGCATCAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T G TTATT TAAATAC T TAACCACTAAT T TT GAAAAT TACCAGTGT GATACATAGGAATCAT TATTC 
AGAAT GT AGTCTGGT C T T TAGGAAGT ATTAATAAGAAAAT T T GCACATAACT TAG TTGAT TCAGA 
AAGGACT TGTATGC T GT T T T TCT CCCAAATGAAGACT CT T T TT GACAC TAAACAC T TT TTAAAAA. 
GCTTATCTTTGCCTTCTCCAAAGAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACTyiTTTAGAGATT 
C TT TGTT TT AT T TCAC TGAT TAAT ATACTGTGGCAAATTACACAGAT TAT TAAAT T T T T TTACAA 
GAGTATAGT ATAT T TAT TT GAAAT GGGAAAAGTGCAT TT TACT G TAT T T T GT GT AT T T TGTT TAT 
T TC TC AGAATATGGAAAGAAAAT T AAAATGTGT CAAT AAAT AT T T TCTAGAGAGTAA 
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FIGURE 108 

MAREDS VKCLRCLLYALNLLFWLMS I SVIAVSAWMRDYLNNVLTLTAETRVEEAVI LT YFPWHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQRErKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSL1KNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 

CCAilGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT iiAAGT CTTTCTCGTGAGAGCCT AGAGGCCTTAAAAAAAAAAGT GCTTGAAAGAGAAGGGGACAAAGGAACA 
CC AGT AT T AAGAGGATT TT C CAGT GTT TCT GGC AGT T GGTCC AGAAG GATGCCT C CAT T CCT GCT T CT C AC CT G 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACTGACCACCAGTT GGATGAGTCTCAAGGTCCTCCT CTATGT GACAACC ATGTGAATGGGGAG 
T GGT ACC AC TT C AC GGGC AT GGC GGGAGAT GC CAT GCCT ACC T T CT GCAT AC C AGAAAAC C AC T GT GGAACCC A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TC AAT GG GAACT GCT GT CT CT GGAAC AC C AC GGT GGAAGT C AAGGC T TGCC C T GGAGGCT ACT AT GTGT AT C GT 
CT GACCAAGCCC AGC GT CT GCT T CC AC GT CT AC T GT GGT C AT T T TT ATGAC ATCT GC GAC GAGGAC TGCCAT GG 
C AGCT GC T CAG AT AC C AGC GAGT GC AC AT GC GCT C C AGGAACT G T GC T AGGC CCT GACAG GC AGAC AT GCT TT G 
ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
GAGTGTGGGGTTGGCCGTGT GCTAAGAAGTGATGGCAAGACT TGTGAAGACGTT GAAGGAT GCCACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGAT AACCACACT TGCCAAGTCCCTGT GTTGT GCAAAT CAAATGCCATTGAAGTGAAC ATCCCC AGGGAG 
CTGGTT G GT GGCCT GGAGC T CT T CCT GACC AAC AC CT CC T GC CGAGGAGT GT CCAACGGC ACCC AT GT C AAC AT 
CCT CTTCT CT CT CAAGAC AT GTGGTACAGTGGTCGATGT GGT G AAT GAC AAGAT T GT GGCC AGC AACC T CGT GA 
C AGGTCT ACCCAAGCAGACCCCGGGGAGCAGCGGGGACT TCATCAT C CGAACCAGC AAGCTGCT GATCCCGGT G 
ACCTGCGAGTT TCCACGCCT GTACACCATTTCT GAAGGAT AC GTTCCCAACCTTCGAAACTCCCCACT GGAAAT 
CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCT T G GT GGAGAGCT GCT TT GCC ACCC C C ACC TCC AAGAT CGACGAGGT CCTGAAAT AC T ACCT CAT 
CCGGGATGGCT GTGT TT CAGAT GACTCGGTAAAGCAGT ACACATCCC GGGAT C ACC TAGCAAAGCACT TCCAGG 
TCCCTGTCTT C AAGT TT GT GGGC AAAGACC ACAAGGAAGT GT T T CT GC ACT GCC GG GT TCT T GT CT GT GGAGT G 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGT CTACAGG GCC AGAC GCT AACAGGC GGCCC GATC C GC AT CGACT GGGAG GACTAGT T CGT AGC C AT AC CT C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAATCAG AAGC TGGGTATAAT AT TTCAAGT TACAAACCCTAGAAAAAT TAAAC AGT T ACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTAT AGGGAATTT GGAAGT GT ATCAAT AAAACAGT ATAT AAT TTT 



FIGURE HO 



MP P FLLLTCL FI TGT S VS PVALDFCSAY I SLNE PW RNT DHQLDE SQGP PLCDNH VNGEW Y H FTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YC GH FY D I CDE DC HGSC S DT S E CT C APGT VLG P DRQTC FDENEC E QNNGGC SE I CVNL KN S Y RC E C GVGR V 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGBCDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites . 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites . 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy lati on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGCAGC AGCTTGCT CAGC GG AC AAGGATGCTGGGCGTGAGGG ACCAAGGCCTGCCC T GC ACTCGG 
GCCT CC TCCAGC CAGT GCTGACCAGGGACTTCT GAC CTGCTGGCCAGCCAGGAC CTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGT GACAAT CT C AGCT CCAGGCT AC AGGGAGACCGGG AGGATC AC AG AGCC AGC ATGT 
TACAGG ATCCTGAC AGTG AT CAACCTCT GAACAGC CTCGAT GTCAAACCCCTGC GC AAACCCCGT ATC CCC 
AT GGAG ACCT T C AGAAAG GT GGGGATCCCC AT CAT C AT AGCACT ACTGAGCCTGGC GAGT AT CATC AT TGT 
GGTT GTCCT CAT CAAGGT GATT CT GGAT AAAT ACT ACTTCCT CT GCGGGC AGCCTC TCC ACT TCATCC CGA 
GGAAGC AGCT GT GT GACGG AGAGCTGGACT GT CCCT TGGGGGAG GACGAGGAGC AC TGTGTC AAGAGC TTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTT CT CTGCCT GT T T CG AC AAC T TC AC AG AAGCTC TCGCTGAGAC AGCCTGTAGGC AGATGG 
GC TACAGC AGAGCT GTGG AGAT T GGCCC AG AC CAGG ATCTGGAT GTTGTTGAAATC AC AG AAAAC AGCC AG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
Ifl TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
|ff T CAGC ATCCAGT ACGACAAACAGCAC GTCTGT GGAGGG AGCATCCTGG ACCCCC ACTGGGTC CT C ACGGC A 

U| GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
* CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 

TCGCCC TCAT GAAGCT GC AGTT CCC ACT CACT T T CTC AGGCACAGTC AGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTC TG ACAT ACT GCTGCAGGCGT C AGT CCAGGTC ATTGACAGC AC ACGGTGCAATGC AGACGATG CGT 
ACCAGGGGGAAGTCACCG AGAAGAT G ATGTGT GCAGGCAT CCCGGAAGGG GGTG TGGACACCT GCC AGGGT 



o 



in 

Hi 



fU GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAillGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCT TGGAG CAGCAAAGGGCCT CAATTC CTGT AAGAGACCCT CGC AGCCCAGAG GCGCCC AGAGGAAGTCA 
GCA(SCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGT AT T GC TAAGCC AAGAAGGAACTTTCCC AC ACTACT GAATGG AAGC AG GCTGTCTT GT AAAAGCC 
CAGATC ACT GTGGGCTGG AGAG GAGAAG GAAAGGGT CTGC GCCAGCCCTGT CCGTC TTCACCCATCCCCAA 
GCCT ACTAGAGC AAGAAACC AGTT GT AAT AT AAAAT GC ACT G CC CT ACTGT T GGT ATGACTACCGTT ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIWVLIKVILDKYYFLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRS TLQVLDSATGNWFSACFDN 
FTEAIAETACRQMG YSRAVE I GPDQDLDWE I 

KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKI I IIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEKLTPATPLWI IGWG 
FTKQNGGKMSDILLQASVQVI DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
DQWHWG I VS WGYGCGGP S T PGVYTKVS AYLNW I YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lis 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACC T TCACT CTGGAACGAGAACAGAGGT T TC TACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGC T TAGACCACCTGGT TTATGTGACAGGACT T GCAT TC TCCT GGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
C TCC T TCCCTCTGAGAGGCCCT CCTAT GTCCC TACTAAAGC CACCAGCAAGACATAGC TGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C TC AATT TAAATCAT GT T C TAGTAAT TGGAGC TGTCCCCAAGACCAAAGGAGCTAGAGCTT GGT T 
CAAATGATCTCCAAGGGCCCT TATACCCCAGGAGACT TTGATT TGAATT TGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGGATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAATTA 
TGGTTATTTGTAA 
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FIGURE 114. 



5 i 

III 
If? 

U s 

S . p 



Pi 

IK 

-55 



fit 

fii 



MLWKLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPL^GAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE 115 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGC^ 

AGA^TCCTCCAGAGAATTGTGAAGACTGTC^CATTCTAAATGCAGAAGCTTTTAAATCC^GAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T T T GGGGGAGCAAGCACT T CTGGC CGGAGGTACCCAAAAAAGCCTATGACAT GGAGCACACT T 
T CTACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAAT T GATCC T G T GACCAGAACT GAAATA 
T TCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACT TTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAAT TACCACAAC T T TCT T TGAACAG T CAGTG 
AT TT GGGTCCCAGCAGAAAAGCC T AT TGAAAACCGAGAT T T TCT TAAAAAT TCCAAAAT T CTGGA 
GATT TGTGATAACGTGACCATGTATT GGATCAATCCCACTCTAATATCAGTT TCTGAGTTACAAG 
ACT T T GAGGAGGAGGGAGAAGATCT T CAC T T T CC TGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGT GGGTGGTCCCTCAAGT GAAAGTAGAGAAGACC CGTCACGCCAGACAAGC AAGTGAGGA 
AGAACTT CCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGCT GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

GGCTACT ACCCATATCCATAC T GC TACCAAGGAGGAC GAG T CAT C TGT CG TGTCATCATGCCTTG 
TMiCTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKl^PPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFiTSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myr i s toyl a ti on si tes . 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGT ACGAAGGGCTCT GG AGGAGCT GCGTGAGGC AGAGTT C AGGCT TC AC CGAATGC AGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
3 . CC AAAGCC AACATGAC ACTGACCT CCGGG ATCAT GTTC ATTGTC TC AGGTCT T T GTGC AAT T GCTGGAGT G 

pi TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
Pi GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 

f|| tcac:actaattgggggtgtgatgatgtgcatcgcctgccggggcctggcaccagaagaaaccaactacaaa 

gccgtttcttatcatgcctcaggccacagtgttgcctacaagcctggaggcttcaaggccagcactggctt 
tggcjtccaacaccaaaaacaagaagatatacgatggaggtgcccgcacagaggacgaggtacaatcttatc 
111 cttccaagcacgactatgtgtaatgctctaagacctctcagcacgggcggaagaaactcccggagagctca 
^ cccaaaaaacaaggagatcccatctagatttcttcttgcttttgactcacagctggaagttagaaaagcct 
!^ cgatttcatctttggagaggccaaatggtcttagcctcagtctctgtctctaaatattccaccataaaaca 
rS, gctgagttatttatgaattagaggctatagctcacattttcaatcctctatttctttttttaaatataact 

m 

m ttctactctgatgagagaatgtggttttaatctctctctcacattttgatgatttagacagactccccctc 

rs 5 TTCCTCCTAGTCAATAAACCCATTGATGATCT ATTTCCCAGCTTATCCCCAAGAAAACTTTT GAAAGGAAA 

P| GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
jff| AC ACTT ACT GAAGAAG AAGC AAT AAG AGAAAGAT AT TTGT AATC TCTCCAGCCC AT G ATCTCGGTTTTCTT 

ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGAC ATCT T CT TATT AC AGCAACAC CATTCT AGGAGT TT CC TGAG CTCTCCAC TGGAGTCC T CTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AATAAATAAT GT T T T AGT AAAATGAT AC ACTATC TCTGTGAAAT AGCCTC ACCC CT AC ATGT GGAT AGAAG 
GAA^TGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
T GGTGGC ATACACCT GTAGTCC CAGC ATTCCGGGAGGCTG AGGT GGGAGG AT CACT TG AGCC CAGGGAGGT 
T GGGGC TGCAGT GAGC CATGAT CAC ACCACTGCACTCCAGCCAGGTGAC ATAGC GAGATCCT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MS TTTCQWAFLLS ILGLAGC I AATGMDMWS TQDLYDNPVT S VFQYEGLWRSCVRQS S G FTECRP 
YFTILGLPMLQAVRALMIVGIVL^ 

CAI AGVS VFANML VTNFWMS TANMY T GMGGMVQTVQT RYT FGAAL FVGWVAGGLTL I GGVMMC IA 
CRGIiAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGA2U\AAC TGTTCTCTTCTGT GGCACAGAGAACCC T GCT T CAAAGCAGAAGTAGCAGT T CC GGAGTC C 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAiVACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGGAGGCTAA 
CAT CAGGAT GCAGTGCAAAAT CTAT GATT CC CT GC T GGC T CTT T C T CC G GAC C TACAGG CAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
AC C AGGT GCAC GGGGGACAAT GAGAAGGT GAAG GC T CACATTCT G C TGAC GGC T GGAAT C ATC T T CAT 
CAT(^ACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAATAGTGAATGTT GCCCAAAAAC GTGAGCTTGGAGAAGCTCT CTACTTAGGATGGACCACGGCA 

PI 

:^ CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
|-3 CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 
HI TCTACTCCAGAAGTCAGTATGT GTAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
HI ACAkAAAT CTAT AT TAC TT T C T CAAAATGGAC CCC AAAGAAACTT T GAT T T AC T GT TCT TAAC T G C C T 

III AAT CTTAATTAC AGGAACT GT GCATCAGCTATT TAT GAT T CTATAAGCT ATTTCAGCAGAATGAGATA 

^ T T AAACC C AAT GC T TT GAT T G T T C TAGAAAG TATAG TAAT TT GTT T T CT AAGG T GGT T C AAGCAT CTA 

Sf CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

ill 

n ACGACAT AGCAT TAT G TAC AT AGATGAGT GT AACAT T TAT ATCTC ACAT AGAGAC AT GC T TAT AT GGT 

fit TTTATTTAAAATGAAAT GCCAGTCCAT TACACT GAATAAATAGAACT CAACTATTGCTT TTCAGGGAA 

ATCATGGATAGGGTTGAAGAAGGTTAC TAT T AATTGTTTAAAAACAGCT TAGGGAT TAATGTCCTCCA 
Til TTTATAAT GAAGATTAAAATGAAGGCT TTAAT CAGCATTGTAAAGGAAATTGAATGGCTTTCT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTC CAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATC T AAGAAAAAGT GAT GGTTTT AGGAAAGTG 
AAAATATT TTTGTTTTT GTAT TT GAAG AAGAAT GAT GCAT TTTGACAAGAAATCATATATGTATGGAT 
ATAT TTTAATAAGTATT TGAGTACAGACT T T GAGGT T TC AT CAAT ATAAAT AAAAG AGC AGAAAAAT A 
T GT C T TG G TTT T CAT T T GCT TAC CAAAAAAACAACAACAAAAAAAGT T GT CCT TT GAGAACT T CACCT 
GCT CCTAT GTGGGTAC C T GAG T CAAAAT T GT CAT TT T TGT T CT G T GAAAAATAAAT TTCCTTCT T GT A 
CCA.T T TCT GTT TAGTTT TACTAAAATC TGTAAATAC TGTATTTTTCT GT TTAT TCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATATTAAT AAAT TGTACATTTT TCTAAT T 
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FIGURE 12Q 

MMHALEIAGLFLGGVGMVGTVAVTVMPQ 

IYDSLLALSPDLQAARGLMCAASVMSFLAFMMAILGMKCTRCTGDNEKVK^ 
MWLIPVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSS Y 
RYS I PSHRTTQKS YHTGKKSPSVYSRSQYV 



Signal peptide: 

i . amino acids 1-17 




Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCACHACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGT GGACC T GTATAAT GGAATGTGCT TACAAGGGCCAGCAGGAG TGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TC AAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGCT T T GAGGAGTCC T GGACACCCAACTAC 
AAGCAGT GTTCATGGAGTTCAT TGAATTATGGCATAGATCTTGGGAAAAT TGCGGAGT GT ACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAA^TGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
AT TGAAGCT ATAATTTATT T GGACCAAGGAAGCCCTGAAAT GAATTCAACAAT TAATATT CATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
T TGGCACT TGT TCAGAT TACCCAAAAGGAGATGCTTCTACTGGATGGAAT TCAGTTTCTCGCATC 
ATT AT TGAAGAACTAC CAAA ATAAA T GCT T TAAT T T TCAT T TGCTACC TC T TTT TT TATTATGCC 
TTGGAAT GGT TCAC TTAAAT GACAT T TTAAATAAGT T TATGTAT ACAT CT GAAT GAAAAGCAAAG 
C TAAATAT GT T TACAGACCAAAGTGT GAT TTCACACT GT T T T TAAAT C TAGCAT TAT T CAT T TT G 
C TTCAAT CAAAAGTGGTTTCAATAT TTTTTTTAGTTGGT TAGAATACTTTCTTCATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
TAAAT AAAAAT TAT T TCCAACA 



FIGURE 122 



MRPQGPAAS PQRLRGLLLLLLLQLPAPS S AS E I PKGKQKAQLRQREWDL YNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAI I YLDQGS PEMNS T INIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACC T T TGAACAATACAAAGGATGGGT T TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
■ t CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
^ ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
CI ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
O C T T iGACCATAAC CATATCAGCAAT AT TGACGAAAATGCT T TTAATGGAATACGCAGACTCAAAGA 

m GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
Jll TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
fit CGG.AAGCTGCTGAGT T TACAT T TACGG T C TAAC TCCC TGAGAACCAT CCC TG T GCGAAT AT T CCA 

Wj AGAC T GCCGCAACC T GGAACT T T T GGACC TGGGATAT AACC GGATC CGAAGT TT AGCCAGGAAT G 

If! TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
III CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 

CATAGGACAGACCATG TC CTGGACC TGGAGC TCCTTACAAAGGC TTGATT TATCAGGC AAT GAGA 
TCG.AAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
Wi TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
IH TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
Q TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
?ll ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
1": GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
y TGCCCCCGACGGTGGGAGCCAC AGAGCCCGGCCCAGAGACC GAT GCT GAC GC CGAGCACATC TC T 

ft! TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTA.CGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG T CCCTAAAGCAAAT GACT CCCAGCACCCAGGAAT T T T ATG TA 
GAT TATAAACCCACCAACACGGAGAC CAGCGAGATGC TGCT GAAT GGGAC GGGACC CT GCACCTA 
TAA.CAAATC GGGC TCCAGGGAGTGT GAGG TATGAACCAT TGTGATAAAAAGAGC TC TT AAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAACT CCGGTT TAATATAATACCTATTGTATAAGACCCTTTACTGATTCCAT TAAT GT CGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLS GSAVALVIAPTVLLTMLS SAERGCPKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
LS LRYNSLQKLKYNQFKGLNQLTWLYLDHNH I SNI DENAFNGIRRLKELI LS SNRI S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARtJVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGKEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain : 

amino acids 420-442 

N-glycosylation sites . 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site* 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites . 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGT TATCGTCl'TGCGCTACTGCT GAATGTCCG ICCCGGAGGAGGAGGAGAGGCTT T '1' GCCGC TG 
ACCC^GAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCilACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCTT TC T AAAGCTT TGGCAAGGAGT GACAC CCGCCATT TACAGACACGTAGT 
G TAT TCT GGAGGTCGAATGGTCACAT AT GAACATCTCCGAGAGGT TGTGT T TGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAATCCAAC TGACCTAGTGAAGGT TCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT TGCGAT TTCGTGGT GT ACAT CATGCAT T TGCAAAAATC TTAGCTGAAGGAGGAATAC 
GAGGGCT T TGGGCAGGCTGGG TACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGAT T TA 
ACC AC T T AT GATACAG T GAAAC AC T AC T T GG T AT T GAAT AC AC CAC T T GAGG AC AAT ATCAT G AC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CA^AAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGAC TT T TGTAT AAAT CATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTT TAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCi^TVAEIATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALG 1 1 EEEGFLKLWQGVTPAI YRHVVYSGGPMVT YEHLREVVFGKSEDEHYPLWKS 
VI GCjMMAGVI GQFLANPTDLV1CVQMQMEGKRKLEGKPLRFR I LAEGGIRGLWAGWVP 

NIQIU\ALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCMGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCC TCTCCATGT TCTCAGCTAT CCGT TCTCAGCACAGCGGTGTAGAGATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT TCGCC T TCAAAC T CCACGACAAGGACCC TGAGAAGGCAGCT GCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGATTGGAGACAT CCAGAT GAGGCCCACGGAGCAGGTGACCTAGT GACT G T GGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCA.TCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAA.TTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGT GGATTT CATGGT GATCAT TAAAAAAGAAAAAT CGCAACCAA 
AA2AAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGG IGAAVARALVQQGLKWGCARTVGN IEELAAE CKSAGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLA1SICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-nryr i s toyl a t ion s i tes . 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase * 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AA.CT T CTACATGGGCC TCCT GCTGCTGGXGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAA ATGA GGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT TT T T AAC TGATAGTT G T ACATAT T TGGGGGTACAT GT GATAT T TGGATACATGTATACAA 
TAT AT AATGAT CAAATCAGGGT AAC T GGGATAT CCATCACATCAAACAT T TAT T T TT T ATTCT T T 
T TAGACAGAGTC TCAC TC TGTCACCCAGGCTGGAG TGCAG T GGTGC CATC TCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAA.CCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCT CT GATCACC T CAT T CTACT CT C TACC T CCAT GAGAT CCACTT T TT TAGCTCCCACATGT G 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATCCATGT TGCT GCAAAT GACAGGAT T TCGT TCT T AAT T TCAAT TAAAATAACCACACATG 
GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 



Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lfti 



a : 



T TCT GAAGT AACGGAAGCT ACC TT GTAT AAAGAC CTCAAC AC TGCTGACCATGATC AGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGT CGCTGCGAT GC GGGT TTCATT T ACT GTAATGATCGCTTT CT GAC ATCCATTCC AAC AG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TTGi^AAAACTTGCTGAAAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCT 
CCCfiAAGT AT GT AAAAGAGT T ACAT TTGCAAGAAAATAAC AT AAGG ACTATC ACTT ATGATT CACTTT CAA 
AAAT TCCCTATCT GGAAGAAT TACATTT AG AT GAC AACTCTGTCTCTGCAGT T AGC AT AG AAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 



n GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 

GTCT CACT AGTCT AAAACGCCT GGT TCT AG AT GGAAACCTGT TGAAC AAT CATGGT TT AGGT GACAAAGTT 



w 

XX '1. 



TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
|y TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
■%M CTTATCTAAGGC AGCT CT AT CGAC TGGATATGTC C AAT AATAACCT AAGT AATT T ACCTC AGGGT ATCTTT 

fl GAT GATTTGGAC AAT ATAAC ACAACTGATTCT TC GC AACAAT CCCTGGT ATTGC GGGTGC AAGAT G AAATG 

W GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 

3 

AGGT TCGTGGGAT GGCTATT AAGGATCT CAAT GC AG AACTGT T T GATT GT AAGGAC AGTGGGATTGT AAGC 
ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
f«s CAA^CAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
f|| CAAT TACAAT TACT GTGAAGTCTGTC ACCTCT GAT ACC ATTC AT AT CT CTTGGAAACTTGCT CT ACCT ATG 

Q ACT GCTTTGAGAC T CAGCTGGCTT AAACTGGGCC AT AGCCCGGC ATTTGGAT CT AT AACAGAAAC AAT TGT 

f|| AACAGGGG AACGCAGTGAGT ACTT GGTC ACAGCCC T GGAGCCTGAT TC ACCCTAT AAAGT AT GCATGG TTC 

CC AT GG AAAC CAGC AACCT CTACC T ATTTGAT GAAACTCCTGT T T GTATTGAGACTGAAACT GCACCC CTT 
CGAHTGTACAAC CCT ACAACCACC CT CAATCGAGAGCAAG AGAAAGAACCTT AC AAAAACCC CAATTT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T T CATAGG AATGGAT CGCT CTT CT C AAGGAACTGTGCAT AT AGC AAAGGG AGGAGAAG AAAGGATG AC TAT 
GCAGAAGCTGGCACT AAG AAGGAC AACTCT AT CC T GGAAATC AGGG AAACTT CT TTTC AG AT GTT ACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
AC A?\AAAC AATCAC AGTG AAAGCAGT AGT AAC CGAAGCT ACAGAGACAGTGGTATTCC AG ACT C AG AT CAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWWDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGS PSRKT I T I TVKSVTSDT I HI SWKLALPMTALRLSWLKLGHSPAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
P YKNPNLPLAAI I GGAVALVT I ALLALVCW YVHRNGS L FSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNC3yiNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-rayristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE iaa 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCT GGAGAGAGC CACC G T C TT C CTAGAACAGAGGC TGCCTGAAATCAACC TGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
T GC AGCC GC T GAGC CTGCGCGTG GG GAT G C T GG GGGAGAAGC T GGAGGCT GC CAT C CAGAGATC C 
C TCCACT ACCTCAAGC T GAGTGAT CC CAAGTACC TAAGAGAG T TCCAGCTGACC CT CCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACT CATTC TCAGAGGAGAGAAGTGACG TG T GCC T GGT GCAGC TGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GAC CACT CCAACAGAGCCAGGACT AT ATCAACC T C T T CT GC GCCAACATGATGGACTT GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTAT TCAA 
TAT CAGCAGCAT T T T TCGAGGAGAGT GAAGAGGCGAGAAAAACAA.T TTCCAGAT TCT CGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGC TGAGACGGACGGTTCCATGCCAGCT GCC TGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
AT.AAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLtA^ 

TAE SKAT IADL I LSALERATVFLEQRLPE INLDGMVGVRVLEEQLKS VREKWAQEPLLQPLS LRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y INLFCANMMDLNRRAEAI GYAYPTRDI FMENIMFCGMGG FS DFYKLRWLEAI LS WQKQQEGCFG 
EPDMDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites . 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GGT CTGAGT GCAGAGC TGCT GT CATGGCGGCCGCTCT GT GGGGCT TCI'TT CCCGI'CCT GCI'GC TG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGAT CGCT TCAAGAT TGAGGGGCGT GCAGTTGTTCCAGGGGT GAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT T TGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAR.T TACAT CAAAACAT CAGAGGTT GT CAGAC TGCC CTAT CC TCTCCAAATGAAATCT TCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAAT TCCAACCATGAGT TGCCTGAT 
G TT T CTGAGTTCATGACAAGACT CT T CTC T TCAAAAT CATC TGGCAAATC TAGCAGCGGCAGCAG 
T AAAACAGGCAAAAGT GGGGCT GGCAAAAGGAGGTAGTC AGGC CGTCCAGAGCT GGCAT T T GCAC 
AAA.CACGGCAACAC TGGG TGGCAT CCAAG TC T TGGAAAACCGT G TGAAGCAACT ACTATAAACT T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAAT AT TGATGTCACTGAAT TAAT TACAGTGT CCTATAGAAAATGCCATTAATAAATTAT 
ATG AACT AC TATACAT TATGTATAT TAAT TAAAACAT CTTAAT CCAGAAATCAAAAAAAAAA&AA 
AAAAAAAAAAAAAAA 



FIGURE 1*6 

MAA^WGFFPVLLLLLLSGDVQSSSVPGAAAEGSGGSGVGIGDRFKIKGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYWEW 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFIJYINPMVM^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 

GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACt^AACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAG C T G T G T GAT C GC CACAAAC C T TC AGGAAATAC GAAAT GGAT T T T C T GAGATAC GG G G C AGT G 
TGC^AGCCAAAGATGGAAACATTGACATCAGAATCTTAAGGAGGACTGAGTCTTTGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCAT CAAGAAGGACC TCCGGCT CTCT CAT GCCCACAT GACAT GCCAT TGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCA.GGCAGCAGTTGTGAAGGC 
T TT GGGGGAACTAGACAT T CT T CT GCAATGGAT GGAGGAGACAGA ATAG GAGGAAAG T GATGCTG 
C TGC T AAGAATAT T CGAGGT CAAGAGCTC CAGT CT TCAATACCTGCAGAGGAGGCATGAC CCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
C T T CCTT GCATGAT TGTC T TTATGCATCCCCAATC T T AAT T GAGACGATACT TGTATAAGAT TT T 
TGT.AATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
ATGTATTTAT TTTTTTACTTGGACAT GAAACT TTAAAAAAATTCACAGATTATAT TTATAACCTG 
ACTAGAGCAGGTGATGTAT T T T TATACAG TAAAAAAAAAAAACC TT GTAAAT TCTAGAAGAGTGG 
C TAGGGGGGT TATTCATTTGTAT TCAACTAAGGACATAT T TACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAWGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSRI^WQFFLLLWSPDFRPKMKJISSIAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSE I RGS VQAKDGNI DI RI LRRTE SLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYT LRKI S 

SLAMSFLTIKKDLRLSHMMTCHCGEEAI^K^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites . 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACG ATGC GGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCT CTCT TCAGTAAAGT T G T TATT GTTCTGATAGAT GCCT TGAGAGATGAT TT T G T G TT TG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T T T G TGGCT GAAGCAAAGCCACCTACAGT TAC TATGCCTCGAATCAAGGCAT T GAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC T GGAAAAAGAATAGT C T T T TAT GGAGAT GAAACCT GGGT TAAAT TA 
T TC CCAAAGCAT T T T G TGGAATATGATGGAACAAC CT CAT TT T TCGTG TCAGAT TACACAGAGGT 
y~ GGATAAT AATGTCACGAGGCAT TT GGATAAAGTAT TAAAAAGAGGAGATT GGGACATATT AATCC 

pi TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
S AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
|f GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
CM GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 

CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
Iff ACT TGGCT TACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 

ijp CAATGAGAGAGCAGT TGAGATT T TTACAT TTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 

7 1 AAT G TGC CGT CATATGAAAAAGAT CCTGGGTT TGAGCAGT TTAAAATGTCAGAAAGAT TGCATGG 

W GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
s TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
m CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
r s | CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
U GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
fll AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
r! TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
fip CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
*** TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CAC TCAGGAGGGTCAAAAGGAGACTT GGTCGCACCACTCAT CCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGATCCACAGAGCCCAGGATCAAGGGACCCACT GCAGTGGCAGCAGGACT GTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C TGACCGAGACACT CACAGCT T TGTCATCAGGGCACAGGCT TCCTCGGAGCCAGGAT GAT CTG TG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYL^^ 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGAS S TEE VNTPL I L I S S AFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myr i s toyl a ti on s i tes . 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE ldi 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCC TATAGAAAACAACT GCCAGCAC CT TAAGACCACTCACACCT TCAGAGT GAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGT GACAAGGATAAAG GACAAAG T CAT CC ATCC C T T C AG C T GAAGAAG GAGAAAC T GATGAAGC T 
III GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
fl ACA.TGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
tn G TT GGGGTGACAGATAAAT T TGAGAACAGGAAACACAT T GAAT TT TCATT TCAACCAGTTT GCAA 

*Jf AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 

ill 
If! 



Ul 



rsi 

5 nr.' 



ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHPCVLVLDSGNLIAVPDKNYIRPEI 
FFALAS SLSSASAEKGS P I LLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKE SARRP FI 
FYFAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 



1ST 



cAMP- and cGMP-dependent protein kinase phosphorylation site, 

amino acids 33-36 



N-myristoylation site. 

amino acids 50-55, 87-92 



amino acids 37-182 



^ Interleukin-1 

y 1 

m 

5 , 5 



Irl 
ci 
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li! C TC CT GAACAAGACCT GC T T GGAGGGAT TCCAC TGAGT GAAACCCAC TCACAGGCTTGTCCATGT 
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FIGURE 143 



C TAGAGAGT ATAGGGCAGAAGGATGGCAGATGAGTGACT CCACAT CCAGAGC TGCCTCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGG GAGAACTAGGTGTGGGAAAT TT GAAGACGACAT TGACAACTGCCAT T TCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 



GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCCTAATCATGTAGT GTAGATCAT TAAACATCAGCAT T T TAAGAAAAAAAAAAA 



9 

n AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



2 I: ! 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNT FTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-miyxistoylation site. 

amino acids 9-15 
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FIGURE 145 

CT G'i'GCAGCTCGAGGC TCCAGAGGCACACTCCAGAGAGAGCCAAGGT TCT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGC CCAGGTGGCT GAGAACCGCCCGGGAGCC T TCATCAAGCAAGGCCGCAAGCT CGACAT T 
GACT T CGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACT GGCAGT TCCCCGAT GGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAG CT TGCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAAT CC TGGCAAG T GACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACC TGCT TCACAGTACT TCCCAACAACT C T TAGAGGTAG 
GTGTATT CCCGT TT TACAGATAAGGAAAC TGAGGCCCAGAGAGCTGAAGTACTGCACC CAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAAT T C CAGAGC T GT CT GGTGAT CACT TTATGTCTCACAGGGACCCACATC CAAACATGTAT 
CTC TAAT GAAAT T GT GAAAGC T CCAT GT T TAGAAATAAATGAAAACACC T GA 
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FIGURE ld6 

MRKHLS WWWLAT VCMLL FSHLS AVQTRGI KHR I KWNRKALP S T AQ I T EAQVAENRPGAF I KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEAJWTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites, 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



H 3 I 

in 
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HI 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTGAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACT T T CAGAT TCAGGGGGTACATGTGAAGGT TT G TTT TAT GAGTATAT TGCATGAT GCTGAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLF^EYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE l&Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAiULAG C T ACAGCT C C AGGAGC CC AG CG CC GG GC T GTGAC CCAAGCC GAGCG T GGAAG.AATGGG GT T 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAAT TAAGTGCAGAAAGACCTTT GAATGAACAGATTGCTGAA 
GCAC5AAGAAGACAAGATTAAAAAAA.CATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCTCCC CACT T GATAATAAGT T GAATGTGGAAGATGT TGATTCAACCAAGAAT CGAAAACTG 
AT CGATGAT TAT GACT C T AC T AAGAGT GGAT TGGAT CATAAATTT CAA.GAT GAT CCAGATGGT CT TC A 
TCAACTAGACGGGACTC CT TTAACCGCTGAAGACAT TGT C CATAAAAT CGCTGCCAGGATTTATGAAG 
AAAiVT GAC AGAGC CGT GTT T GACAAGATT GT TT CT AAAC T ACT T AATC T C GG C CT TAT C ACAGAAAG C 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTATGAGGAGGATCCCAATAA.GCCCACAAGCTGGACTGAGAAT CAGGCT GGAAAAATAC CAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
TT AkCCTTGACAAATGGCTT GGAAAGGAGAACTAAAACC TACAGT GAAGACAACTTTGAGGAACT CCA 
ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACTGATTACTATCAT GAAAACACT GAT T GACT TTGT GAAGAT GAT GGT GAAATAT GGAACAATATCT 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
AGA^AAAAATG C T ACT GACAATATAAG CAAG CT T T T CCCAGCAC CAT CAGAGAAGAGT C AT GAAGAAA 
CAGACAGT ACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATATGGAAGCTTGAAGGAT TCCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAA.GAT GAGAGACTT CAT CAATAAACAAGC T GATGCTTATGTGGAGAAAGGCATCCTT GACAAGGAA 
GAAGCCGAGGCCATC^GCGCATTTATAGCAGCCTG TAAA AATGGCAAAAGATCCAGGAGTCTTTCAA 
CT G T T T CAGAAAACAT AAT AT AGC TTAAAACACTT C TAAT TCT GT GAT T AA&AT T T TT T GACC CAAG G 
GTTAT TAGAAAGT GCTGAAT T TACAGT AGT TAACCT TTTACAAGT GGTTAAAACATAGC TTTCT TCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAE1AEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 

FQDDPDGLHQLDGTFLTAED I VHKIAAR I YEENDRAVFDKIVSKLLNLGL I TE SQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS IDSEKEAKEKETL I T IMKTL IDFVKMMVKYGT I S PEEGV 
S YLENLDEMI ALQTKNKLEKNATDNI SKLFPAP S EKSHEETDSTKEEAAKMEKE YGS LKDS TKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 

EEASAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myr i s toyl a tion si tes : 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATC5GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 

ata^[taaccagcttctagctggagggctgcatgcagggaaggtcattaaaggtgaagagatcagc 
gtggtccccaatcggtggctggatgccagcctgtcccccgtcatcctgggtgtccagggtggaag 
ccagtgcctgtcatgtggggtggggcaggagccgactctaacactagagccagtgaacatcatgg 
agctctatcttggtgccaaggaatccaagagcttcaccttctaccggcgggacatggggctcacc 
tccagcttcgagtcggctgcctacccgggctggttcctgtgcacggtgcctgaagccgatcagcc 
tgtc^gactcacccagcttcccgagaatggtggctggaatgcccccatcacagacttctacttcc 
agcagtgtgactagggcaacgtgccccccagaactccctgggcagagccagctcgggtgaggggt 
gagtggaggagacccatggcggacaatcactctctctgctctcaggacccccacgtctgacttag 
tgggcacctgaccactttgtcttctggttcccagtttggataaattctgagatttggagctcagt 
ccacggtcctcccccactggatggtgctactgctgtggaaccttgtaaaaaccatgtggggtaaa 
ctgggaataacatgaaaagatttctgtgggggtggggtgggggagtggtgggaatcattcctgct 
taatggtaactgacaagtgttaccctgagccccgcaggccaacccatccccagttgagccttata 
gggtcagtagc tctccacatgaagtc ct g tcac tcac cact g t gcaggagagggagg t ggtcata 
gagtcagggatctatggcccttggcccagccccacccccttccctttaatcctgccactgtcata 
tgctacctttcctatctcttccctcatcatcttgttgtgggcatgaggaggtggtgatgtcagaa 
gaaatggct cgagc t cagaagataaaagataag tagggtatgctgatcctc tt t taaaaacccaa 

GAT ACAATCAAAAT CCCAGATGC T GGTCT CTAT TCCCAT GAAAAAG TGCTCATGACATAT TGAGA 
AGACCTACTTACAAAGTGGCATATATTGCAATT TATTTTAAT TAAAAGATACCTATT TATATATT 
TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATT TT TC TT TTAAT T C T GT TAAT T TATC TGTATT TCCTAAT T T T TC TACAAT GAAGATGA 
AT TCCTT GTATAAAAATAAGAAAAGAAAT TAATCT TGAGGT AAGCAGAGCAGACATCATC TCTGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T TGTAGT AGTGAT CAGGAAACAGATC TCAGCAAAGCCACTGAGGAGGAGGCTGT GCTGAGT T TGT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GAT CACAGCCCC TGGGAT TCCAAGGC AT TGGATCCAGTC TCTAAGAAGGC TGCT G T AC TGG T TGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGT TAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAAT TCAATAT 
GACTGGT T T CCT TG TATGAAAAGGAGAGGACACAGAGACAGAGGAGAC GCGGGGAAGACT ATGTA 
AAGAT GAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGAT TGT GGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAAT TCT TCCCTAGAGGCTT TAGAGGGATAACGGCTC TG 
CTGAAACCTTAATC T CAGACT T CCAGCCTCCT GAACGAAGAAAGAATAAAT T TCGGCT G T T T TAA 
GCCACCAAGGATAAT TGG T T ACAGCAGCT CTAGGAAACTAATACAGCT GC TAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGAT GGCATGCCAC T TC CAAGAT TAGGT TATAAAAGACACTGCAGCT TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGC CACAT TAGTGAACCTAGAAGCAGAGAC T CT GT GAGATAATCGATGTT TGTT GT TT T 
AAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 



MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLSPVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKS FT FYRRDMGLTSS FESA&YPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 




Interleukin-1 signature. 



amino acids 111-131 



Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 



IE 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAA.TTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAAT TGGAGAACTGGAT TT GCTGTT TATGT CT CT GAGAAAT 
GCCTGCAT TTGACCAGAGCAAAGCT GAAAAAT GAATAACTAACCCCCTTTCCCT GCTAGAAATAA 
CAAT T AGAT GCCCCAAAGCGATT TT TT TTAACCAAAAGGAAGATGGGAAGCCAAACT CCATCAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATATTTATTGATAACATTTCATTGTAACTGGTGTTC 
TATACACAGAAAACAAT TTAT T TT T T AAATAAT TG TCTTT T TC CATAAAAAAGAT T ACT TTCCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTAT TTAT TAT TAT TATAAGACTGCATT TTATTTATAT CAT TT TAT TAATATGGAT TTAT T TAT 
AGA&ACATCAT T CGATAT TGCT ACT T GAG TGTAAGGCTAATAT TGATAT T TATGACAATAAT TAT 
AGAGCTATAACATG T T TATT TGAC CT CAATAAACACT TGGATATCCC 



FIGURE 154 

MAALQKS VS S FLMGTLATSCLLLLALLVQGGAAAP I SSHCRLDKSNFQQP Y I TNRT FMLAKEAS L 
ADNISTTDWLIGEKLFHGVSMSERCYI^QVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myr i s toyl a ti on s i te s . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAGTGCCCGACT TGTGAC T GAGT GTGCAGTGCCCAGCAT GTACCAGGTCAGT GCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGG T GAGGACAGT TC TC T CAT T AGCCT T TTCCTACAGGTGGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCC CAAC CGCCACCCAGAGT CCTGTAGGGCCAG TGAAGATGGACCC CTCAACAGCAGGGC CATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACT TGCCATGAAGGGCCAGGATGCCCAGATGCT T GGCCCCTGTGAAGTGCT GT CT GGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAA.GAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCiCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGAT GAT TC TGAGGAGGAAGCTGTTAT TGAAT G TATAGAGAT TTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAfclVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNFIHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of* the protein: 
Signal peptide: 

amino acids 1-32 



N-glycosylation site. 



y 

III amino acids 136-14 0 

w 



Tyrosine kinase phosphorylation site, 



t s amino acids 127-135 

N-myristoylation sites. 

^5. amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCT CG TGCT GCTAAGCCTGGCCGCGCT GTGCAGGAGCGC CGTACC CCGAGAGCC 
GACCGT T CAATGT GGC TC TGAAACTGGGCCATCT CCAGAGT GGAT GCTACAACATGATCT AATCC 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 

GAT T T GT GT GACGGGCAAAAGCAACTTCCAGTCC TACAGCTG T GTGAGGT GCAAT T ACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 

GAGC T GAAC ACAGT C TATT T CAT T GGGGCCC ATAAT ATT CCTAAT GCAAATATGAAT GAAGAT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGT GTG T CAAGGCCGGAAGCCT GT GGGATCCGAACAT CACTGCT TGTAAGAAGAAT GAGGAGACA 
G TAGAAGTGAACTT CACAACCACT CCCC TGGGAAACAGATACATGGCTC T TATCCAACACAGCAC 
TAT CATCGGG TTTTC TCAGGT GTT TGAGCCACACCAGAAGAAACAAACGCGAGC TT CAGT GGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

CAT GGG T GCT GGTGGCAGGGAT CT AT CTAATGTGGAGGCAC GAAAGGAT CAAGAAGACTT CCT TT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

TCACACAAT T TG T TAC TT CACT GAAT TT C TTCAAAACCAT T GCAGAAGTGAGG T CAT CCT TGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

G TGATCT AAGAAGCCAGAT T CAT CT GCAC AAAT AC GT GGTGG T C TAC T T T AGAGAGAT T GAT ACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADAS I RLLKATK I C VT GKSN FQ S Y SC VRCNYT E AFQTQTRPS GG KWT F S Y I G F P VELNT V Y F IG AHN I P 
NAmNE DG PSMS VNFT S PGCLDH IMKYKKKCV^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 



ij Important features of the protein: 

ys Signal peptide: 

^ amino acids 1-14 

%? £ 
1 1| 

J[j Transmembrane domain: 

amino acids 290-309 

y ■ 

HI N-glycosylation sites. 

Q amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



- 287 



CAM]?- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACAT^CAGTGAA.GACCCTGCATGGCCCAGCCATGGTCAAGTACTTGC'rGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTCjACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

ACTCAGCTGAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRS TS PWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKED I SMMS VPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGGACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAi^GGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC TGAAGATGAGGAAAAGT T TGGAGGAGCAGCT GACT CAGGGGTGGAGGAGCCT AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
Uh TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
f-| T GC T T CGAGGCTGC CCTAGGGAGT GAGGTACGAATC T GGTCCTATAC TCAGCCCAGGT AC GAGAA 

jSj GGAACTCAACC^CACACAGCAGCTGCCTGCCCTGCCCTGGCTC 

ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGA^CATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
W TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 

AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
JJ{ CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
y CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
f|| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
PI CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 

CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 



1.K ; 



I'll 



CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GGGACT TAAATAAAGGCAGAC GCTGTT T T TCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHIAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQ FGQSVGS WYDC FEAALGSE VR I WSYTQ PRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLPXLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VN S S E KLQLQEC LW AD SLGP LKDD VLLL ET RGPQ DN RS LC AL E P SGCT SL PS KAST RAARLGE Y LLQD LQ S 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKE<DHAKGWLRLLKQDVRSGAAARG 
RAAIiLL YSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQG PVAWFHAQRRQTLQEGGVWLL FSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites . 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-58 6 

cAM?- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCI CTGTGCCAGCCCCGATGAGGACGC TGCT GACCAT CT TGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTGAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

TT C TACCAC T TAGAGC TCCAGGTCAACCGCACC TACCAAATGCACC T TGGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACC TACT TAGGGCAGCCAGACATCTCCATCCT CCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGC^TCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CAC C T TAGGCC TAAAGGT CAGC TT CAGAAAGAGCCAC CAGCTGGAAGCTGCATGT TAGGT GGCC T 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGAT T TGCACAGACAGAACATC TGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGGATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAAC TCCCGAGGT CCAGAGCC CCACCT C 

CTA^CACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAA.TCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAA^lTGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

C TCAC CACTCACAAGCTG T GT GACT T CAAAC AAATGAAATCAGT GCCCAGAACCTCGGT TT CC T C 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 16 J. 
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MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VftKKGCQRI TRKS CNLTVE T GNLTEL YYARVTAVSAGGRS ATKMTDRFS SLQHT TLKPPDVTC I S 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGT IMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLT FQPLRFI QEHVLI PVFDL SGPS S LAQPVQYSQIRVSGPRE PAGAPQRHS LSE IT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 



Transmembrane domain. 

Q amino acids 233-250 



N-gly cosy 1 a t i on s i tes . 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAi^AGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT GT TGAACACAGTT T T GGAT AT T T TCCAAAAGATT TGAT CAAGGTACTTCAT AAATACACGGA 
AGAAGAGC TACATATTCCAGGAGATGAGACAGACT T T GT C T GC TT TGAAGGAGGAAGAGATGAT T 
T TAATAGT TATAATGTAGAAGAGCT T TTAGGATCTTT GGAACTGGAGGACTCTGTACCTGAAGAG 
T CGAAGAAAGCTGAAGAAG T T T CTCAGCACAGAGAGAAATCTC C TGAGGAGTCT CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGC TGCAGGGACAGCCC TCAGCTCAGGAGAGCC AC CCT CACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGC T TACAAAGT CCT GAAAACAGAAATGAGTC AGAGAGGAAGT GGAC AGTGCGTTAT T 
CAT TACAGCAAAGGAT T TCGTT GGCATCAAAATCTAAGT TT GT T TTACAAAGAT TGT T T T TAG TA 
C TAAGCT GCCT TGGCAGT T T GCATT T T T GAGCCAAACAAAAAT ATAT TATT TT CCCT T CTAAGTA 
AAA&AAAAAAAAAAAAAAAA 
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FIGURE 166 



MAA^^PGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLIKVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKT DAYKVLKT EMS QRGS GQCVI HY SKGFRWHQNL S L FYKDC F 

Important features of the protein: 
Signal peptide: 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 




N-glycosylation site. 



amino acids 1-22 



amino acids 294-298 




amino acids 67-76 



Tyrosine kinase phosphorylation site. 



N-myristoylation sites. 



amino acids 205-211, 225-231, 277-283 



Amidation site. 



amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGGAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
G C AT AAAAG T AAAGGAAGAAC AAAAAC T AGT C CAAAC CAG T C T AC AC T GT GGAT T T C AAAGAG T A 
C CAGAAAA.GAAAG T GGCAT G GAAG T ATAAT AAC T CAT T G AC T T GG T T CC AGAAT T T T G T AAT T C T 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 

AAT AAGAAGC T AT T T CAT T AAG T G T G AT AT AAACC T C C T CAAACAT T T T AC T T AGAG G C AAG GAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAA.GATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063553 



Page 319 of 320 



FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTMTPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQ^^VWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S T L W I S KS TRKE S GMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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